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(57) Abstract 



DNA constructs, Pichia methanolica cells, and methods for producing polypeptides using the DNA constructs and cells are disclosed. 
The DNA constructs comprise a transcription promoter of a Pichia methanolica gene, a DNA segment encoding a polypeptide heterologous 
to P. methanolica, a transcription terminator of a P. methanolica gene, and a selectable marker. P, methanolica cells containing the DNA 
constructs are used within methods for producing polypeptides heterologous to P. methanolica, DNA molecules comprising transcription 
promoters of P. methanolica genes arc also disclosed. 
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COMPOSITIONS AND METHODS FOR PRODUCING HETEROLOGOUS 
5 POLYPEPTIDES IN PICHIA METHANOLICA 

Background of the Invention 

10 Methylotrophic yeasts are those yeasts that are 

able to utilize methanol ^s a sole source of carbon and 
energy. Species of yeasts that have the biochemical pathways 
necessary for methanol utilization are classified in four 
genera, Hansenula, Pichia, Candida, and Torulopsis . These 

15 genera are somewhat artificial, having been based on cell 
morphology and growth characteristics, and do not reflect 
close genetic relationships (Billon -Grand, Mvcotaxon 
35:201-204, 1989; Kurtzman, Mvcoloaia 84:72-76, 1992). 
Furthermore, not all species within these genera are capable 

2 0 of utilizing methanol as a source of carbon and energy. As a 

consequence of this classification, there are great 
differences in physiology and metabolism between individual 
species of a genus . 

Methylotrophic yeasts are attractive candidates for 
25 use in recombinant protein production systems. Some 

methylotrophic yeasts have been shown to grow rapidly to high 
biomass on minimal defined media. Certain genes of 
methylotrophic yeasts are tightly regulated and highly 
expressed under induced or de-repressed conditions, suggesting 

3 0 that promoters of these genes might be useful for producing 

polypeptides of commercial value. See, for example, Faber et 
al.. Yeast 11:1331, 1995; Romanes et al . , Yeast 8:423, 1992; 
and Cregg et al . , Bio/Technoloov 11:905, 1993, 

Development of methylotrophic yeasts as hosts for 
3 5 use in recombinant production systems has been slow, due in 
part to a lack of suitable materials (e.g., promoters, 
selectable markers, and mutant host cells) and methods (e.g.. 
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transformation techniques) . The most highly developed 
methylotrophic host systems utilize Pichia pastoris and 
Hansenula polymorpha (Faber et al . , Curr . Genet . 25:305-310, 
1994; Cregg et al . , ibid.; Romanos et al . , ibid,; U.S. Patent 
5 No. 4,855,242; U.S. Patent No. 4,857,467; U.S. Patent No. 
4,879,231; and U.S. Patent No . 4,929,555). 

There remains a need in the art for methods of 
transforming additional species of methylotrophic yeasts and 
for using the transformed cells to produce polypeptides of 
10 economic importance, including industrial enzymes and 

pharmaceutical proteins. The present invention provides such 
methods as well as other, related advantages. 

Summary of the Invention 

15 The present invention provides DNA constructs, 

cells, and methods for heterologous polypeptides in Pichia 
methanolica cells . 

Within one aspect of the invention, there is 
provided a DNA construct comprising the following operably 

20 linked elements: (1) a transcription promoter of a Pichia 
methanolica gene; (2) a DNA segment encoding a polypeptide 
heterologous to P. methanolica; (3) a transcription terminator 
of a P. methanolica gene; and (4) a selectable marker. Within 
one embodiment, the transcription promoter is a promoter of a 

25 methanol -inducible P. methanolica gene. Methanol -inducible 
genes include, without limitation, genes encoding alcohol 
oxidase, dihydroxyacetone synthase, formate dehydrogenase, and 
catalase genes. Within certain preferred embodiments, the 
promoter is an alcohol oxidase gene comprising a sequence of 

3 0 nucleotides as shown in SEQ ID NO: 2 from nucleotide 24 to 

nucleotide 1354 or a sequence of nucleotides as shown in SEQ 
ID NO: 9 from nucleotide 91 to nucleotide 169. Within another 
embodiment, the selectable marker is a P. methanolica gene. 
Within a further embodiment, the selectable marker is a P. 

3 5 methanolica ADE2 gene. Within a preferred embodiment, the 

ADE2 gene comprises a sequence of nucleotides as shown in SEQ 
ID NO:l from nucleotide 407 to nucleotide 2851. Within an 
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alternative embodiment, the selectable marker is a gene from a 
fungus other than P. methanolica, such as the yeast 
Saccharomyces cerevisiae. Within another alternative 
embodiment, the selectable marker is a dominant selectable 
5 marker. Within an additional embodiment, the DNA construct is 
a linear molecule. 

Within a second aspect of the invention there is 
provided a P, methanolica. cell containing a DNA construct as 
disclosed above, wherein the cell expresses the DNA segment. 

10 Within one aspect, the DNA construct is integrated into the 
host cell genome. 

Within a third aspect, the present invention 
provides a method for producing a polypeptide heterologous to 
P. methanolica, e.g., a higher eukaryotic polypeptide, 

15 comprising culturing a P. methanolica cell containing a DNA 
construct as disclosed above and recovering the polypeptide 
encoded by the DNA segment . Within this aspect comprises the 
steps of culturing a P, methanolica. cell into which has been 
introduced a heterologous DNA construct as disclosed above 

20 under conditions in which the DNA segment is expressed, and 

recovering the polypeptide encoded by the DNA segment- Within 
one embodiment, the culturing step is carried out in a liquid 
medium comprising methanol and a repressing carbon source. 
Within a second embodiment, the cells are cultured to a 

25 density of from 80 to 400 grams of wet cell paste per liter 
prior to recovering the polypeptide. Within a further 
embodiment, the polypeptide is a human polypeptide. Within an 
additional embodiment, the heterologous DNA molecule is a 
linear DNA molecule. 

3 0 Within a fourth aspect, the present invention 

provides a DNA molecule comprising a transcription promoter of 
a P. methanolica gene, wherein the DNA molecule is essentially 
free of coding sequence of the gene. Within a related aspect, 
there is provided a DNA molecule comprising a transcription 

35 promoter of a P. methanolica gene, wherein the promoter is 

operably linked to a DNA segment encoding a protein other than 
a P. methanolica protein. Within one embodiment, the promoter 
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is a promoter of a methanol -inducible gene. As disclosed 
above, methanol -inducible genes include, without limitation, 
alcohol oxidase, dihydroxyacetone synthase, formate 
dehydrogenase, and catalase genes. Within other embodiments, 
5 the promoter comprises a sequence of nucleotides as shown in 
SEQ ID NO: 2 from nucleotide 24 to nucleotide 1354 or as shown 
in SEQ ID NO : 9 from nucleotide 91 to nucleotide 169. 

Another aspect of the invention provides a method 
for preparing Pichia methanolica cells containing a 

10 heterologous DNA construct as disclosed above. The DNA 

construct is introduced into P. methanollca cells, the P. 
methanolica cells are cultured under conditions that select 
for growth of cells containing the DNA construct; and cells 
that grow under the selective conditions are recovered. 

15 These and other aspects of the invention will 

become evident upon reference to the following detailed 
description and the attached drawings. 



2 0 Brief Description of the Drawings 

Fig. 1 illustrates the effects of field strength 
and pulse duration on electroporation efficiency of P. 
me thanol i ca . 

Fig. 2 is a schematic diagram of a recombination 
25 event between plasmid pCZR140 and P. methanolica genomic DNA. 

Fig. 3 is a schematic diagram of a recombination 
event between plasmid pCZR137 and P. methanolica genomic DNA. 

Detailed Description of the Invention 

3 0 Prior to setting forth the invention in more 

detail, it will be useful to define certain terms used herein: 
A "DNA construct" is a DNA molecule, either single- 
or double -stranded, that has been modified through human 
intervention to contain segments of DNA combined and 
35 juxtaposed in an arrangement not existing in nature. 



wo 97/17450 



PCT/US96/17944 



5 

"Early log phase growth" is that phase of cellular 
growth in culture when the cell concentration is from 2 x 10^ 
cells/ml to 8 X 10^ cells/ml. 

"Heterologous DNA" refers to a DNA molecule, or a 
5 population of DNA molecules, that does not exist naturally 
within a given host cell. DNA molecules heterologous to a 
particular host cell may contain DNA derived from the host 
cell species so long as that host DNA is combined with 
non-host DNA. For example, a DNA molecule containing a 
10 non-host DNA segment encoding a polypeptide operably linked to 
a host DNA segment comprising a transcription promoter is 
considered to be a heterologous DNA molecule . 

A "higher eukaryotic" organism is a multicellular 
eukaryotic organism. The term encompasses both plants and 
15 animals. 

"Integrative transf ormants" are cells into which 
has been introduced heterologous DNA, wherein the heterologous 
DNA has become integrated into the genomic DNA of the cells. 

"Linear DNA" denotes DNA molecules having free 5' 
20 and 3' ends, that is non-circular DNA molecules. Linear DNA 
can be prepared from closed circular DNA molecules, such as 
plasmids, by enzymatic digestion or physical disruption. 

The term "operably linked" indicates that DNA 
segments are arranged so that they function in concert for 
25 their intended purposes, e.g., transcription initiates in the 
promoter and proceeds through the coding segment to the 
terminator . 

The term "promoter" is used herein for its 
art -recognized meaning to denote a portion of a gene 

3 0 containing DNA sequences that provide for the binding of RNA 
polymerase and initiation of transcription. Promoter 
sequences are commonly, but not always, found in the 5' 
non-coding regions of genes. Sequence elements within 
promoters that function in the initiation of transcription are 

35 often characterized by consensus nucleotide sequences. These 
promoter elements include RNA polymerase binding sites; TATA 
sequences; CAAT sequences; differentiation-specific elements 
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(DSEs; McGehee et al . , Mol . Endocrinol . 7:551-560, 1993); 
cyclic AMP response elements (CREs) ; serum response elements 
(SREs; Treisman, Seminars in Cancer Biol. 1:47-58, 1990); 
glucocorticoid response elements (GREs) ; and binding sites for 
5 other transcription factors, such as CRE/ATF {O'Reilly et al , , 
J. Biol. Chem. 267:19938-19943, 1992), AP2 (Ye et al . , 
Biol. Chem. 269:25728-25734, 1994), SPl, cAMP response element 
binding protein (CREB; Loeken, Gene Expr. 3:253-264, 1993) and 
octamer factors. See, in general, Watson et al . , eds . , 

10 Molecular Biology of the Gene , 4th ed. , The Benjamin/Cummings 
Publishing Company, Inc., Menlo Park, CA, 1987; and Lemaigre 
and Rousseau, Biochem. J. 303:1-14, 1994. 

A "repressing carbon source" is a metabolizable , 
carbon- containing compound that, when not limited, suppresses 

15 the expression in an organism of genes required for the 

catablism of other carbon sources. By "limited" is meant that 
the carbon source is unavailable or becomes available at such 
a rate that it is immediately consumed and therefore the 
prevailing concentration of that carbon source in an 

20 organism's environment is effectively zero. Repressing carbon 
sources that can be used within the present invention include 
hexoses and ethanol . Glucose is particularly preferred. 

"Rich" culture media are those culture media that 
are based on complex sources of nutrients, typically cell or 

25 tissue extracts or protein hydrolysates . Rich media will vary- 
in composition from batch to batch due to variations in the 
composition of the nutrient sources. 

As noted above, the present invention provides DNA 
constructs, cells containing the DNA constructs, DNA molecules 

3 0 comprising transcription promoters, and methods for producing 
heterologous polypeptides in the methylotrophic yeast Pichia. 
methanolica.. Those skilled in the art will recognize that 
transformation of cells with heterologous DNA is a 
prerequisite to a large number of diverse biological 

35 applications. Cells so transformed can be used for the 
production of polypeptides and proteins, including 
polypeptides and proteins of higher organisms, including 
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humans. The present invention further provides for the 
transformation of Pichia methanolica. cells with other DNA 
molecules, including DNA libraries and synthetic DNA 
molecules. The invention thus provides techniques that can be 
5 used to express genetically diverse libraries to produce 

products that are screened for novel biological activities, to 
engineer cells for use as targets for the screening of 
compound libraries, and to genetically modify cells to enhance 
their utility within other processes. 

10 Strains of Pichia methanolica are available from 

the American Type Culture Collection (Rockville, MD) and other 
repositories. Within one embodiment of the invention, cells 
to be transformed with heterologous DNA will have a mutation 
that can be complemented by a gene (a "selectable marker") on 

15 the heterologous DNA molecule. This selectable marker allows 
the transformed cells to grow under conditions in which 
untransf ormed cells cannot multiply ("selective conditions") . 
The general principles of selection are well known in the art. 
Commonly used selectable markers are genes that encode enzymes 

20 required for the synthesis of amino acids or nucleotides. 

Cells having mutations in these genes cannot grow in media 
lacking the specific amino acid or nucleotide unless the 
mutation is complemented by the selectable marker. Use of 
such "selective" culture media ensures the stable maintenance 

25 of the heterologous DNA within the host cell. A preferred 

selectable marker of this type for use in Pichia methanolica 
is a P. methanolica ADE2 gene, which encodes 
phosphor ibosyl - 5 - aminoimidazole carboxylase ( AIRC ; EC 
4.1.1.21) . The ADE2 gene, when transformed into an ade2 host 

30 cell, allows the cell to grow in the absence of adenine. The 
coding strand of a representative P. methanolica ADE2 gene 
sequence is shown in SEQ ID NO:l. The sequence illustrated 
includes 1006 nucleotides of 5' non-coding sequence and 442 
nucleotides of 3' non-coding sequence, with the initiation ATG 

35 codon at nucleotides 1007-1009. Within a preferred embodiment 
of the invention, a DNA segment comprising nucleotides 
407-2851 is used as a selectable marker, although longer or 
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shorter segments could be used as long as the coding portion 
is operably linked to promoter and terminator sequences. 
Those skilled in the art will recognize that this and other 
sequences provided herein represent single alleles of the 
5 respective genes, and that allelic variation is expected to 
exist. Any functional ADE2 allele can be used within the 
present invention. Other nutritional markers that can be used 
within the present invention include the P. methanolica ADEl, 
HISS, and LEU2 genes, which allow for selection in the absence 

10 of adenine, histidine, and leucine, respectively. 

Heterologous genes, such as genes from other fungi, can also 
be used as selectable markers. For large-scale, industrial 
processes where it is desirable to minimize the use of 
methanol, it is preferred to use host cells in which both 

15 methanol utilization genes {AUGl and AUG2) are deleted. For 
production of secreted proteins, host cells deficient in 
vacuolar protease genes {PEP4 and PRBl) are preferred. 
Gene-deficient mutants can be prepared by known methods, such 
as site-directed mutagenesis. P. wethanolica genes can be 

2 0 cloned on the basis of homology with their counterpart 

Saccharomyces cerevisiae genes. The ADE2 gene disclosed 
herein was given its designation on the basis of such 
homology. 

To prepare auxotrophic mutants of P. wethanolica, 
25 cells are first exposed to mutagenizing conditions, i.e. 

environmental conditions that cause genetic mutations in the 
cells. Methods for mutagenizing cells are well known in the 
art and include chemical treatment, exposure to ultraviolet 
light, exposure to x-rays, and retroviral insert ional 

3 0 mutagenesis. Chemical mutagens include ethylmethane sulfonate 

(EMS) , W-methyl-iST' -nitro-2\^-nitrosoguanidine , 

2 -methoxy- 6 - chloro- 9 - [ 3 - (ethyl - 2 - chloroe thyl ) aminopropylamino] 
acridine'2HCl, 5-bromouracil , acridine, and aflatoxin. See 
Lawrence, Methods Enzvmol . 194:273-281, 1991. The proportion 
35 of mutagenized cells obtained is a function of the strength or 
amount of mutagenizing agent to which the cells are exposed. 
A low level of mutagen produces a small proportion of mutant 
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cells. Higher levels of mutagen produce a higher proportion 
of mutant cells, but also kill more cells. It is therefore 
necessary to balance mutagenesis with killing so that a 
reasonable number of mutant cells is obtained. Balancing is 
5 generally done empirically by exposing cells to different 
conditions to establish a killing curve. In general, the 
cells are exposed to mutagenizing conditions and cultured for 
one day, after which they are tested for viability according 
to standard assay methods. Within the present invention, it 

10 is preferred to use a level of mutagenesis that results in 
20-50% mortality, although one skilled in the art will 
recognize that this value can be adjusted as necessary, for 
example if working with a very large number of cells. 

Mutagenized cells are then cultured in a rich 

15 medium to allow mutations to become established and replicated 
in at least a portion of the cell population. This step 
allows cells in which the genome has been altered to replicate 
the mutation and pass it on to their progeny, thereby 
establishing the mutation within the population. 

20 The cells are then transferred to a culture medium 

deficient in assimilable nitrogen so that cellular nitrogen 
stores are depleted. By "deficient in assimilable nitrogen" 
it is meant that the medium lacks an amount of nitrogen 
sufficient to support growth of the cells. Depletion of 

25 cellular nitrogen stores will generally require about 12 to 24 
hours of incubation, with 16 hours being sufficient under 
common conditions. Following depletion of nitrogen stores, the 
cells are cultured in a defined culture medium comprising an 
inorganic nitrogen source and an amount of an antifungal 

30 antibiotic sufficient to kill growing P. methanollca, cells. 
The antibiotic nystatin (mycostatin) is particularly 
preferred. Preferred inorganic nitrogen sources are those 
comprising ammonium ions, such as ammonium sulfate. In 
general, the medium will contain 10-2 00 mM ammonium, 

3 5 preferably about 6 0 mM ammonium. Nystatin is included at a 
concentration of 0.1 to 100 mg/1, preferably 0.5 to 20 mg/L, 
more preferably about 2 mg/L (10 units/L) . Treatment with 
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antibiotic is carried out for ten minutes to six hours, 
preferably about 1 hour. Those skilled in the art will 
recognize that the actual antibiotic concentration and 
exposure time required to kill prototrophic cells can be 
5 readily determined empirically, and certain adjustments may be 
necessary to compensate for variations in specific activity 
between individual batches of antibiotic. By depleting 
cellular nitrogen stores and then culturing the cells in a 
defined medium containing an inorganic nitrogen source and an 

10 antibiotic, cells that are auxotrophic for amino acid or 

nucleotide biosynthesis remain alive because they cannot grow 
in the defined medium. Growing cells are killed by the 
antibiotic. Following the antibiotic treatment, the cells are 
transferred to a rich culture medium. 

15 Auxotrophic mutations are confirmed and 

characterized by determining the nutrient requirements of the 
treated cells. Replica plating is commonly used for this 
determination. Cells are plated on both rich medium and media 
lacking specific nutrients. Cells that do not grow on 

20 particularly plates are auxotrophic for the missing nutrient. 
Complementation analysis can be used for further 
characterization . 

Within another embodiment of the invention, a 
dominant selectable marker is used, thereby obviating the need 

25 for mutant host cells. Dominant selectable markers are those 
that are able to provide a growth advantage to wild- type 
cells. Typical dominant selectable markers are genes that 
provide resistance to antibiotics, such as neomycin-type 
antibiotics (e.g., G418) , hygromycin B, and 

30 bleomycin/phleomycin-type antibiotics (e.g., Zeocin™; 

available from Invitrogen Corporation, San Diego, CA) • A 
preferred dominant selectable marker for use in P. me tha.no lica 
is the Sh hla gene, which inhibits the activity of Zeocin™. 
Heterologous DNA can be introduced into P. 

35 methanolica cells by any of several known methods, including 
lithium transformation (Hiep et al . , Yeast 9:1189-1197, 1993; 
Tarutina and Tolstorukov, Abst . of the 15th International 
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Specialized Symposium on Yeasts , Riga (USSR), 1991, 137; Ito 
et al., J. Bacteriol. 153:163, 1983; Bogdanova et al . , Yeast 
11:343, 1995), spheroplast transformation (Beggs, Nature 
275:104, 1978; Hinnen et al . , Proc . Natl, Acad . Sci . USA 
5 75:1929, 1978; Cregg et al . , Mol . Cell. Biol. 5:3376, 1985), 
freeze -thaw polyethylene glycol transformation (Pichia 
Expression Kit Instruction Manual, Invitrogen Corp., San 
Diego, CA, Cat, No. K1710-01) , or electroporation, the latter 
method being preferred, Electroporation is the process of 

10 using a pulsed electric field to transiently permeabilize cell 
membranes, allowing macromolecules , such as DNA, to pass into 
cells. Electroporation has been described for use with 
mammalian (e.g., Neumann et al . , EMBO J. 1:841-845, 1982) and 
fungal (e.g., Meilhoc et al . , Bio/Technolocr/ 8:223-227, 1990) 

15 host cells. However, the actual mechanism by which DNA is 
transferred into the cells is not well understood. For 
transformation of P. me t ha.no lica, it has been found that 
electroporation is surprisingly efficient when the cells are 
exposed to an exponentially decaying, pulsed electric field 

20 having a field strength of from 2.5 to 4,5 kV/cm and a time 

constant (t) of from 1 to 4 0 milliseconds. The time constant 
t is defined as the time required for the initial peak voltage 
Vq to drop to a value of Vo/e. The time constant can be 
calculated as the product of the total resistance and 

25 capacitance of the pulse circuit, i.e., t = R x C. Typically, 
resistance and capacitance are either preset or may be 
selected by the user, depending on the electroporation 
equipment selected. In any event, the equipment is configured 
in accordance with the manufacturer's instructions to provide 

30 field strength and decay parameters as disclosed above. 
Electroporation equipment is available from commercial 
suppliers (e.g., BioRad Laboratories, Hercules, CA) . 

DNA molecules and constructs for use in 
transforming P. methanolica will commonly be prepared as 

35 double -stranded, circular plasmids, which are preferably 
linearized prior to transformation. For polypeptide or 
protein production, the DNA constructs will include, in 
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addition to the selectable marker disclosed above, an 
expression cassette comprising a transcription promoter, a DNA 
segment (e.g., a cDNA) encoding the polypeptide or protein of 
interest, and a transcription terminator. These elements are 
5 operably linked to provide for transcription of the DNA 

segment of interest. It is preferred that the promoter and 
terminator be that of a P. methanolica gene. Useful promoters 
include those from constitutive and methanol -inducible 
promoters . Promoter sequences are generally contained with 

10 1.5 kb upstream of the coding sequence of a gene, often within 
1 kb or less. In general, regulated promoters are larger than 
constitutive promoters due the presence of regulatory 
elements. Methanol -inducible promoters, which include both 
positive and negative regulatory elements, may extend more 

15 than 1 kb upstream from the initiation ATG. Promoters are 
identified by function and can be cloned according to known 
methods . 

A particularly preferred methanol -inducible 
promoter is that of a P. jT?e thanolica alcohol utilization gene. 

2 0 A representative coding strand sequence of one such gene, 

AUGl, is shown in SEQ ID NO : 2 . Within SEQ ID NO : 2 , the 
initiation ATG codon is at nucleotides 1355-1357. Nucleotides 
1-23 of SEQ ID NO: 2 are non-AC7Gl polylinker sequence. It is 
particularly preferred to utilize as a promoter a segment 
25 comprising nucleotides 24-1354 of SEQ ID NO:2, although 

additional upstream sequence can be included. P. itiBthanolica 
contains a second alcohol utilization gene, AUG2, the promoter 
of which can be used within the present invention. A partial 
DNA sequence of one AUG2 clone is shown in SEQ ID NO: 9. AUG2 

3 0 promoter segments used within the present invention will 

generally comprise nucleotides 91-169 of SEQ ID NO: 9, although 
small truncations at the 3' end would not be expected to 
negate promoter function. Other useful promoters include 
those of the dihydroxyacetone synthase (DHAS) , formate 
35 dehydrogenase (FMD) , and catalase (CAT) genes. Genes encoding 
these enzymes from other species have been described, and 
their sequences are available (e.g., Janowicz et al . , Nuc . 
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Acids Res . 13:2043, 1985; Hollenberg and Janowicz, EPO 
publication 0 299 108; Didion and Roggenkamp, FEBS Lett , 
303:113, 1992). Genes encoding these proteins can be cloned 
by using the known sequences as probes, or by aligning known 
5 sequences, designing primers based on the alignment, and 

amplifying P. methanol ica DNA by the polymerase chain reaction 
(PGR) . 

Constitutive promoters are those that are not 
activated or inactivated by environmental conditions; they are 

10 always transcriptionally active. Preferred constitutive 

promoters for use within the present invention include those 
from glyceraldehyde- 3 -phosphate dehydrogenase, triose 
phosphate isomerase, and phosphoglycerate kinase genes of P. 
methanolica. These genes can be cloned by complementation in 

15 a host cell, such as a Saccharomyces cerevisiae cell, having a 
mutation in the counterpart gene. Mutants of this type are 
well known in the art. See, for example, Kawasaki and 
Fraenkel, Biochem. Biophvs . Res. Gomm. 108:1107-1112, 1982; 
McKnight et al . , Cell 46:143-147, 1986; Aguilera and 

20 Zimmermann, Mol . Gen . Genet . 202:83-89, 1986. 

The DNA constructs of the present invention will 
further include a selectable marker to allow for 
identification, selection, and maintenance of transf ormants . 
The DNA constructs may further contain additional elements, 

25 such an origin of replication and a selectable marker that 

allow amplification and maintenance of the DNA in an alternate 
host (e.g., E. coli) . To facilitate integration of the DNA 
into the host chromosome, it is preferred to have the entire 
expression segment, comprising the promoter- -gene of 

30 interest- -terminator plus selectable marker, flanked at both 

ends by host DNA sequences. This is conveniently accomplished 
by including 3' untranslated DNA sequence at the downstream 
end of the expression segment and relying on the promoter 
sequence at the 5' end. When using linear DNA, the expression 

3 5 segment will be flanked by cleavage sites to allow for 

linearization of the molecule and separation of the expression 
segment from other sequences (e.g., a bacterial origin of 
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replication and selectable marker) . Preferred cleavage sites 
are those that are recognized by restriction endonucleases 
that cut infrequently within a DNA sequence, such as those 
that recognize 8-base target sequences (e.g.. Not I). 
5 Proteins that can be produced in P. methanolica 

using the methods of the present invention include proteins of 
industrial and pharmaceutical interest. Such proteins include 
higher eukaryotic proteins from plants and animals, 
particularly vertebrate animals such as mammals, although 

10 certain proteins from microorganisms are also of great value. 
Proteins that can be prepared using methods of the present 
invention include enzymes such as lipases, cellulases, and 
proteases; enzyme inhibitors, including protease inhibitors; 
growth factors such as platelet derived growth factor, 

15 fibroblast growth factors, and epidermal growth factor; 

cytokines such as erythropoietin and thrombopoietin; and 
hormones such as insulin, leptin, and glucagon. 

For use within the present invention, P. 
methanolica cells are cultured in a medium comprising adequate 

2 0 sources of carbon, nitrogen and trace nutrients at a 

temperature of about 25°C to 35°C. Liquid cultures are 
provided with sufficient aeration by conventional means, such 
as shaking of small flasks or sparging of f ermentors . A 
preferred culture medium is YEPD (Table 1) . The cells may be 

25 passaged by dilution into fresh culture medium or stored for 
short periods on plates under refrigeration. For long-term 
storage, the cells are preferably kept in a 50% glycerol 
solution at -70°C. 
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APE D 



15 
Table 1 

2% D-glucose 

2% Bacto^ Peptone (Difco Laboratories, Detroit, MI) 
1% Bacto™ yeast extract (Difco Laboratories) 
0.004% adenine 
0.006% L-leucine 



0.056% -Ade -Trp -Thr powder 
10 0.67% yeast nitrogen base without amino acids 

2% D-glucose 

0.5% 200X tryptophan, threonine solution 

ADE DS 

0.056% -Ade -Trp -Thr powder 
15 0.67% yeast nitrogen base without amino acids 

2% D-glucose 

0.5% 200X tryptophan, threonine solution 
18.22% D-sorbitol 

LEU D 

20 0.052% -Leu -Trp -Thr powder 

0.67% yeast nitrogen base without amino acids 
2% D-glucose 

0.5% 200X tryptophan, threonine solution 

HIS D 

25 0.052% -His -Trp -Thr powder 

0.67% yeast nitrogen base without amino acids 
2% D-glucose 

0.5% 200X tryptophan, threonine solution 

URA D 

30 0.056% -Ura -Trp -Thr powder 

0.67% yeast nitrogen base without amino acids 
2% D-glucose 

0.5% 200X tryptophan, threonine solution 
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Table 1. continued 
URA PS 

0.056% -Ura -Trp -Thr powder 

0.67% yeast nitrogen base without amino acids 
5 2% D-glucose 

0.5% 200X tryptophan, threonine solution 
18.22% D-sorbitol 
-Leu -Trp -Thr powder 

powder made by combining 4.0 g adenine , 3 . 0 g 

10 arginine, 5.0 g aspartic acid, 2.0 g histidine, 6.0 

g isoleucine, 4.0 g lysine, 2.0 g methionine, 6.0 g 
phenylalanine, 5.0 g serine, 5.0 g tyrosine, 4.0 g 
uracil, and 6.0 g valine (all L- amino acids) 
-His -Trp -Thr powder 

15 powder made by combining 4.0 g adenine, 3.0 g 

arginine, 5.0 g aspartic acid, 6.0 g isoleucine, 
8.0 g leucine, 4.0 g lysine, 2.0 g methionine, 6.0 
g phenylalanine, 5.0 g serine, 5.0 g tyrosine, 4.0 
g uracil, and 6.0 g valine (all L- amino acids) 

2 0 -Ura -Trp -Thr powder 

powder made by combining 4.0 g adenine , 3 . 0 g 
arginine, 5,0 g aspartic acid, 2.0 g histidine, 6.0 
g isoleucine, 8.0 g leucine, 4.0 g lysine, 2.0 g 
methionine, 6.0 g phenylalanine, 5.0 g serine, 5.0 
25 g tyrosine, and 6.0 g valine (all L- amino acids) 

-Ade -Trp -Thr powder 

powder made by combining 3.0 g arginine, 5.0 g 
aspartic acid, 2.0 g histidine, 6.0 g isoleucine, 
8.0 g leucine, 4.0 g lysine, 2.0 g methionine, 6.0 

3 0 g phenylalanine, 5.0 g serine, 5.0 g tyrosine, 4.0 

g uracil, and 6,0 g valine (all L- amino acids) 
200X tryptophan, threonine solution 

3.0% L- threonine, 0.8% L- tryptophan in H2O 
For plates, add 1.8% Bacto"" agar (Difco 
3 5 Laboratories) 
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Electroporation of P. methanolica is preferably 
carried out on cells in early log phase growth. Cells are 
streaked to single colonies on solid media, preferably solid 
YEPD. After about 2 days of growth at 3 0°C, single colonies 
5 from a fresh plate are used to inoculate the desired volume of 
rich culture media (e.g., YEPD) to a cell density of about 5 - 
10 X 10^ cells/ml. Cells are incubated at about 25 - 35^C, 
preferably 30 °C, with vigorous shaking, until they are in 
early log phase. The cells are then harvested, such as by 

10 centrifugation at 3000 x g for 2-3 minutes, and resuspended. 

Cells are made electrocompetent by reducing disulfide bonds in 
the cell walls, equilibrating them in an ionic solution that 
is compatible with the electroporation conditions, and 
chilling them. Cells are typically made electrocompetent by 

15 incubating them in a buffered solution at pH 6-8 containing a 
reducing agent, such as dithiothreitol (DTT) or 
-mercaptoethanol (BME) , to reduce cell wall proteins to 
facilitate subsequent uptake of DNA. A preferred incubation 
buffer in this regard is a fresh solution of 50 mM potassium 

20 phosphate buffer, pH 7.5, containing 25 mM DTT. The cells are 
incubated in this buffer (typically using one-fifth the 
original culture volume) at about 3 0°C for about 5 to 3 0 
minutes, preferably about 15 minutes. The cells are then 
harvested and washed in a suitable electroporation buffer, 

25 which is used ice-cold. Suitable buffers in this regard 

include pH 6-8 solutions containing a weak buffer, divalent 
cations (e.g., Mg*"", Ca**) and an osmotic stabilizer (e.g., a 
sugar) . After washing, the cells are resuspended in a small 
volume of the buffer, at which time they are electrocompetent 

3 0 and can be used directly or aliquotted and stored frozen 

(preferably at -70°C) . A preferred electroporation buffer is 
STM (270 mM sucrose, 10 mM Tris, pH 7 . 5 , 1 mM MgCla) . Within 
a preferred protocol, the cells are subjected to two washes, 
first in the original culture volume of ice-cold buffer, then 

35 in one-half the original volume. Following the second wash, 

the cells are harvested and resuspended, typically using about 
3-5 ml of buffer for an original culture volume of 2 00 ml. 
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Elect roporat ion is carried out using a small volume 
of electrocompetent cells (typically about 100 /xl) and up to 
one-tenth volume of linear DNA molecules. For example, 0 . 1 ml 
of cell suspension in a buffer not exceeding 50 mM in ionic 
5 strength is combined with 0.1-10 /ig of DNA (vol. 10 ^1) . This 
mixture is placed in an ice-cold electroporation cuvette and 
subjected to a pulsed electric field of from 2.5 to 4.5 kV/cm, 
preferably about 3.75 kV/cm, and a time constant of from 1 to 
40 milliseconds, preferably 10-30 milliseconds, more 

10 preferably 15-25 milliseconds, most preferably about 20 

milliseconds, with exponential decay. The actual equipment 
settings used to achieve the desired pulse parameters will be 
determined by the equipment used. When using a BioRad 
(Hercules, CA) Gene Pulser™ electroporator with a 2 mm 

15 electroporation cuvette, resistance is set at 600 ohms or 

greater, preferably "infinite" resistance, and capacitance is 
set at 25 /xF to obtain the desired field characteristics. 
After being pulsed, the cells are diluted approximately lOX 
into 1 ml of YEPD broth and incubated at 3 0«>C for one hour. 

2 0 The cells are then harvested and plated on 

selective media. Within a preferred embodiment, the cells are 
washed once with a small volume (equal to the diluted volume 
of the electroporated cells) of IX yeast nitrogen base (6.7 
g/L yeast nitrogen base without amino acids; Difco 
25 Laboratories, Detroit, MI), and plated on minimal selective 
media. Cells having an ade2 mutation that have been 
transformed with an ADE2 selectable marker can be plated on a 
minimal medium that lacks adenine, such as ADE D (Table 1) or 
ADE DS (Table 1) , In a typical procedure, 250 fil aliqouts of 

3 0 cells are plated on 4 separate ADE D or ADE DS plates to 

select for Ade" cells. 

P. methanolica recognizes certain infrequently 
occurring sequences, termed autonomously replicating sequences 
(ARS) , as origins of DNA replication, and these sequences may 
3 5 fortuitously occur within a DNA molecule used for 

transformation, allowing the transforming DNA to be maintained 
extrachromosomally . However, integrative transf ormants are 
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generally preferred for use in protein production systems. 
Integrative transf ormants have a profound growth advantage 
over ARS transf ormants on selective media containing sorbitol 
as a carbon source, thereby providing a method for selecting 
5 integrative transf ormants from among a population of 

transformed cells . ARS sequences have been found to exist in 
the ADE2 gene and, possibly, the AUGl gene of P. methanolica. 
ade2 host cells of Pichia methanolica transformed with an ADE2 
gene can thus become Ade'' by at least two different modes. 

10 The ARS within the ADE2 gene allows unstable extrachromosomal 
maintenance of the transforming DNA (Hiep et al . , Yeast 
9:1189-1197, 1993). Colonies of such transf ormants are 
characterized by slower growth rates and pink color due to 
prolific generation of progeny that are Ade". Transforming 

15 DNA can also integrate into the host genome, giving rise to 
stable transf ormants that grow rapidly, are white, and that 
fail to give rise to detectable numbers of Ade" progeny. ADE 
D plates allow the most rapid growth of transformed cells, and 
unstable and stable transf ormants grow at roughly the same 

20 rates. After 3-5 days of incubation on ADE D plates at 30^C 
stable transformant colonies are white and roughly twice the 
size of unstable, pink transf ormants . ADE DS plates are more 
selective for stable transf ormants , which form large (-5 mm) 
colonies in 5-7 days, while unstable (ARS-maintained) colonies 

25 are much smaller (--1 mm) . The more selective ADE DS media is 
therefore preferred for the identification and selection of 
stable transf ormants . For some applications, such as the 
screening of genetically diverse libraries for rare 
combinations of genetic elements, it is sometimes desirable to 

3 0 screen large numbers of unstable transf ormants , which have 

been observed to outnumber stable transf ormants by a factor of 
roughly 100. In such cases, those skilled in the art will 
recognize the utility of plating transformant cells on less 
selective media, such as ADE D. 

3 5 Integrative transf ormants are preferred for use in 

protein production processes. Such cells can be propagated 
without continuous selective pressure because DNA is rarely 
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lost from the genome. Integration of DNA into the host 
chromosome can be confirmed by Southern blot analysis. 
Briefly, transformed and untransf ormed host DNA is digested 
with restriction endonucleases , separated by electrophoresis, 
5 blotted to a support membrane, and probed with appropriate 

host DNA segments. Differences in the patterns of fragments 
seen in untransf ormed and transformed cells are indicative of 
integrative transformation. Restriction enzymes and probes 
can be selected to identify transforming DNA segments (e.g., 

10 promoter, terminator, heterologous DNA, and selectable marker 
sequences) from among the genomic fragments. 

Differences in expression levels of heterologous 
proteins can result from such factors as the site of 
integration and copy number of the expression cassette and 

15 differences in promoter activity among individual isolates. 
It is therefore advantageous to screen a number of isolates 
for expression level prior to selecting a production strain. 
A variety of suitable screening methods are available. For 
example, transformant colonies are grown on plates that are 

20 overlayed with membranes (e.g., nitrocellulose) that bind 

protein. Proteins are released from the cells by secretion or 
following lysis, and bind to the membrane. Bound protein can 
then be assayed using known methods, including immunoassays. 
More accurate analysis of expression levels can be obtained by 

2 5 culturing cells in liquid media and analyzing conditioned 

media or cell lysates, as appropriate. Methods for 
concentrating and purifying proteins from media and lysates 
will be determined in part by the protein of interest. Such 
methods are readily selected and practiced by the skilled 
30 practitioner. 

For small-scale protein production (e.g., plate or 
shake flask production) , P. methanalica transf ormants that 
carry an expression cassette comprising a methanol -regulated 
promoter (such as the AUGl promoter) are grown in the presence 

3 5 of methanol and the absence of interfering amounts of other 

carbon sources (e.g., glucose). For small-scale experiments, 
including preliminary screening of expression levels, 
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transf ormants may be grovm at 3 0 °C on solid media containing, 
for example, 20 g/L Bacto-agar (Difco) , 6.7 g/L yeast nitrogen 
base without amino acids (Difco), 10 g/L methanol, 0.4 ^g/L 
biotin, and 0.56 g/L of -Ade -Thr -Trp powder. Because 
5 methanol is a volatile carbon source it is readily lost on 

prolonged incubation. A continuous supply of methanol can be 
provided by placing a solution of 50% methanol in water in the 
lids of inverted plates, whereby the methanol is transferred 
to the growing cells by evaporative transfer. In general, not 

10 more than 1 mL of methanol is used per 100-mm plate. Slightly 
larger scale experiments can be carried out using cultures 
grown in shake flasks. In a typical procedure, cells are 
cultivated for two days on minimal methanol plates as 
disclosed above at 3 0 °C, then colonies are used to inoculate a 

15 small volume of minimal methanol media (6.7 g/L yeast nitrogen 
base without amino acids, 10 g/L methanol, 0.4 ^9/^ biotin) at 
a cell density of about 1 x 10^ cells/ml. Cells are grown at 
30*^C. Cells growing on methanol have a high oxygen 
requirement, necessitating vigorous shaking during 

20 cultivation. Methanol is replenished daily (typically 1/100 
volume of 5 0% methanol per day) . 

For production scale culturing, fresh cultures of 
high producer clones are prepared in shake flasks. The 
resulting cultures are then used to inoculate culture medium 

25 in a fermenter. Typically, a 500 ml culture in YEPD grown at 
30^C for 1-2 days with vigorous agititation is used to 
inoculate a 5- liter fermenter. The cells are grown in a 
suitable medium containing salts, glucose, biotin, and trace 
elements at 28 °C, pH 5.0, and >3 0% dissolved O2 . After the 

3 0 initial charge of glucose is consumed (as indicated by a 

decrease in oxygen consumption) , a glucose/methanol feed is 
delivered into the vessel to induce production of the protein 
of interest. Because large-scale fermentation is carried out 
under conditions of limiting carbon, the presence of glucose 

3 5 in the feed does not repress the methanol -inducible promoter. 
The use of glucose in combination with methanol under 
glucose- limited conditions produces rapid growth, efficient 
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conversion of carbon to biomass and rapid changes in 
physiological growth states, while still providing full 
induction of methanol- inducible gene promoters. In a typical 
fermentation run, a cell density of from about 80 to about 4 00 
5 grams of wet cell paste per liter is obtained. "Wet cell 

paste" refers to the mass of cells obtained by harvesting the 
cells from the fermentor, typically by centrif ugation of the 
culture . 

10 The invention is further illustrated by the 

following non-limiting examples. 



Examples 



15 Example 1 

P. methanolica cells (strain CBS6515 from American 
Type Culture Collection, Rockville, MD) were mutagenized by UV 
exposure. A killing curve was first generated by plating 
cells onto several plates at approximately 200-250 

20 cells/plate. The plates were then exposed to UV radiation 

using a G8T5 germicidal lamp (Sylvania) suspended 25 cm from 
the surfaces of the plates for periods of time as shown in 
Table 2 . The plates were then protected from visible light 
sources and incubated at 30 °C for two days, 

25 

Table 2 
Viable Cells 

Time Plate 1 Plate 2 Average 

0 sec. 225 229 227 

30 1 sec. 200 247 223 

2 sec. 176 185 181 

4 sec, 149 86 118 

8 sec, 20 7 14 

16 sec. 0 2 1 



35 
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Large-scale mutagenesis was then carried out using 
a 2 -second UV exposure to provide about 20% killing. Cells 
were plated at approximately 10^ cells/plate onto eight YEPD 
plates that were supplemented with 100 mg/L each of uracil, 
5 adenine, and leucine, which were added to supplement the 
growth of potential auxotrophs having the cognate 
deficiencies. Following UV exposure the plates were wrapped 
in foil and incubated overnight at 30 ^C. The following day 
the colonies on the plates (-10^ total) were resuspended in 

10 water and washed once with water. An amount of cell 

suspension sufficient to give an ODgoo of 0-1 " was used to 

inoculate 5 00 ml of minimal broth made with yeast nitrogen 
base without amino acids or ammonia, supplemented with 1% 
glucose and 400 /xg/L biotin. The culture was placed in a 2 . 8 

15 L baffled Bell flask and shaken vigorously overnight at 3 0^C. 

The following day the cells had reached an ODgoo of -1.0 - 2.0. 
The cells were pelleted and resuspended in 500 ml of minimal 
broth supplemented with 5 g/L ammonium sulfate. The cell 
suspension was placed in a 2 . 8 L baffled Bell flask and shaken 

2 0 vigorously at 30 °C for 6 hours. 5 0 ml of the culture was set 
aside in a 250 -ml flask as a control, and to the remainder of 
the culture was added 1 mg nystatin (Sigma Chemical Co., St. 
Louis, MO) to select for auxotrophic mutants (Snow, Nature 
211:206-207, 1966). The cultures were incubated with shaking 

25 for an additional hour. The control and nystatin- treated 

cells were then harvested by centrif ugation and washed with 
water three times. The washed cells were resuspended to an 
ODgoo of 1-0 i^i 50% glycerol and frozen. Titering of 
nystatin- treated cells versus the control cells for colony 

30 forming units revealed that nystatin enrichment had decreased 
the number of viable cells by a factor of 10^. 

10"^ dilutions of nystatin- treated cells were plated 
on 15 YEPD plates. Colonies were replica-plated onto minimal 
plates (2% agar, 1 x YNB, 2% glucose, 400 ^xg/L biotin) . The 

35 frequency of auxotrophs was about 2-4%. Approximately 180 
auxotrophic colonies were picked to YEPD + Ade, Leu, Ura 
plates and replica-plated to various dropout plates. All of 
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the auxotrophs were Ade" . Of these, 3 0 were noticably pink on 
dropout plates (LEU HIS D, etc.; see Table 1) . Of the 30 
pink mutants, 21 were chosen for further study; the remainder 
were either leaky for growth on ADE D plates or contaminated 
5 with wild-type cells. 

The Ade' mutants were then subjected to 
complementation analysis and phenotypic testing. To determine 
the number of loci defined by the mutants, all 21 mutants were 
mated to a single pink, Ade" tester strain (strain #2) . 

10 Mating was carried out by mixing cell suspensions (ODgoo = D 
and plating the mixtures in 10 /xl aliquots on YEPD plates. 
The cells were then replicated to SPOR media (0.5% Na acetate, 
1% KCl, 1% glucose, 1% agar) and incubated overnight at 3 0*^C. 
The cells were then replica-plated to ADE D plates for scoring 

15 of phenotype . As shown in Table 3, some combinations of 

mutants failed to give Ade^ colonies (possibly defining the 
same genetic locus as in strain #2) , while others gave rise to 
numerous Ade* colonies (possibly defining a separate genetic 
locus) . Because mutant #3 gave Ade" colonies when mated to 

20 #2, complementation testing was repeated with mutant #3. If 
the group of mutants defined two genetic loci, then all 
mutants that failed to give Ade"" colonies when mated to strain 
#2 should give Ade* colonies when mated to #3 . Results of the 
crosses are shown in Table 3 . 



25 
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Mutant 

#1 
#3 

5 #10 

#15 
#18 
#24 
#28 

10 #30 

#2 
#6 
#8 
#9 

15 #11 

#17 
#19 
#20 
#22 

20 #27 

#4 
#12 
#16 



25 

As shown in Table 3, most mutants fell into one of 
two groups, consistent with the idea that there are two 
adenine biosynthetic genes that, when missing, result in pink 
30 colonies on limiting adenine media. Three colonies (#4, #12, 
and #16) may either define a third locus or exhibit intragenic 
complementation. Two intensely pigmented mutants from each of 
the two complementation groups (#3 and #10; #6 and #11) were 
selected for further characterization. Additional analysis 

3 5 indicated that Ade' was the only auxotrophy present in these 

strains . 

A P. metha^nolica. clone bank was constructed in the 
vector pRS426, a shuttle vector comprising 2fJL and S, 
cerevisiae URA3 sequences, allowing it to be propagated in S. 
40 cerevisiae. Genomic DNA was prepared from strain CBS6515 
according to standard procedures. Briefly, cells were 
cultured overnight in rich media, spheroplasted with 
zymolyase, and lysed with SDS . DNA was precipitated from the 
lysate with ethanol and extracted with a phenol/chloroform 

4 5 mixture, then precipitated with ammonium acetate and ethanol. 



25 
Table 3 

X Mutant #2 x Mutant *3 

+ 
+ 

+ 



+ 
+ 
+ 



+ + 
+ + 
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Gel electrophoresis of the DNA preparation showed the presence 
of intact, high molecular weight DNA and appreciable 
quantities of RNA. The DNA was partially digested with Sau 3A 
by incubating the DNA in the presence of a dilution series of 
5 the enzyme. Samples of the digests were analyzed by 
electrophoresis to determine the size distribution of 
fragments, DNA migrating between 4 and 12 kb was cut from the 
gel and extracted from the gel slice. The size- fractionated 
DNA was then ligated to pRS426 that had been digested with Bam 

10 HI and treated with alkaline phosphatase. Aliquots of the 

reaction mixture were electroporated in E. coli MC1061 cells 
using a BioRad Gene Pulser™ device as recommended by the 
manufacturer . 

The genomic library was used to transform 5. 

15 cerevisiae strain HBy21A (ade2 ura3) by electroporation 

(Becker and Guarente, Methods Enzvmol. 194:182-187, 1991). 
The cells were resuspended in 1.2 M sorbitol, and six 300 /il 
aliquots were plated onto ADE D, ADE DS , URA D and URA DS 
plates (Table 1) . Plates were incubated at 3 0**C for 4-5 days. 

20 No Ade"" colonies were recovered on the ADE D or ADE DS plates. 
Colonies from the URA D and URA DS plates were replica-plated 
to ADE D plates, and two closely spaced, white colonies were 
obtained. These colonies were restreaked and confirmed to be 
Ura* and Ade^ . These two strains, designated Adel and Ade6 , 

25 were streaked onto media containing 5 FOA (5 fluoro orotic 

acid; Sikorski and Boeke, Methods Enzvmol . 194:302-318). Ura" 
colonies were obtained, which were found to be Ade* upon 
replica plating. These results indicate that the Ade* 
complementing activity is genetically linked to the 

30 plasmid-borne URA3 marker. Plasmids obtained from yeast 

strains Adel and Ade6 appeared to be identical by restriction 
mapping as described below. These genomic clones were 
designated pADEl-1 and pADEl-6, respectively. 

Total DNA was isolated from the HBY21A 

35 transf ormants Adel and Ade6 and used to transform E. coli 

strain MC1061 to Amp**. DNA was prepared from 2 Amp*" colonies 
of Adel and 3 Amp*" colonies of Ade6 . The DNA was digested with 
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Pst I, Sea I, and Pst I + Sea I and analyzed by gel 
electrophoresis. All five isolates produced the same 
restriction pattern. 

PGR primers were designed from the published 
5 sequence of the P. wethanolica. ADE2 gene (also known as ADEl; 
Hiep et al . , Yeast 9:1251-1258, 1993). Primer 9080 (SEQ ID 
NO: 3) was designed to prime at bases 406-429 of the ADE2 DNA 
(SEQ ID NO:l), and primer 9079 (SEQ ID NO:4) was designed to 
prime at bases 2852-2829. Both primers included tails to 
10 introduce Avr II and Spe I sites at each end of the amplified 
sequence. The predicted size of the resulting PGR fragment 
was 245 0 bp. 

PGR was carried out using plasmid DNA from the five 
putative ADE2 clones as template DNA. The 100 ^1 reaction 

15 mixtures contained Ix Tag PGR buffer (Boehringer Mannheim, 
Indianapolis, IN), 10-100 ng of plasmid DNA, 0.25 mM dNTPs, 
100 pmol of each primer, and 1 fil Taq polymerase (Boehringer 
Mannheim) . PGR was run for 3 0 cycles of 3 0 seconds at 94 °G, 
60 seconds at 50<'G, and 120 seconds at 72°G. Each of the five 

20 putative ADE2 genomic clones yielded a PGR product of the 
expected size (2.4 kb) . Restriction mapping of the DNA 
fragment from one reaction gave the expected size fragments 
when digested with Bgl II or Sal I. 

The positive PGR reactions were pooled and digested 

25 with Spe I. Vector pRS426 was digested with Spe I and treated 
with calf intestinal phosphatase. Four /xl of PGR fragment and 
1 fil of vector DNA were combined in a 10 /xl reaction mix using 
conventional ligation conditions. The ligated DNA was 
analyzed by gel electrophoresis. Spe I digests were analyzed 

3 0 to identify plasmids carrying a subclone of the ADE2 gene 

within pRS426. The correct plasmid was designated pGZRllS . 

Because the ADE2 gene in pGZRllS had been amplified 
by PGR, it was possible that mutations that disabled the 
functional character of the gene could have been generated. 

35 To test for such mutations, subclones with the desired insert 
were transformed singly into Saccharomyces cerevisiae strain 
HBY21A. Gells were made electrocompetent and transformed 
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according to standard procedures. Trans formants were plated 
on URA D and ADE D plates. Three phenotypic groups were 
identified. Clones 1, 2, 11, and 12 gave robust growth of 
many trans formants on ADE D. The transformation frequency was 
5 comparable to the frequency of Ura^ transf ormants . Clones 6, 

8, 10, and 14 also gave a high efficiency of transformation to 
both Ura* and Ade*, but the Ade"^ colonies were somewhat smaller 
than those in the first group. Clone 3 gave many Ura* 
colonies, but no Ade^ colonies, suggesting it carried a 
10 non- functional ade2 mutation. Clones 1, 2, 11, and 12 were 
pooled . 

To identify the P. methanolica ade2 complementation 
group, two representative mutants from each complementation 
group (#3 and #10; #6 and #11), which were selected on the 

15 basis of deep red pigmentation when grown on limiting adenine, 
were transformed with the cloned ADE gene. Two hundred ml 
cultures of early log phase cells were haorvested by 
centrif ugation at 3 0 00 x g for 3 minutes and resuspended in 20 
ml of fresh KD buffer (50 mM potassium phosphate buffer, pH 

2 0 7.5, containing 25 mM DTT) . The cells were incubated in this 
buffer at 30*^C for 15 minutes. The cells were then harvested 
and resuspended in 200 ml of ice-cold STM (270 mM sucrose, 10 
mM Tris, pH 7.5, 1 mM MgClj) . The cells were harvested and 
resuspended in 100 ml of ice-cold STM. The cells were again 

25 harvested and resuspended in 3-5 ml of ice-cold STM. 100 ^1 
aliquouts of electrocompetent cells from each culture were 
then mixed with Not I -digested pADEl-1 DNA. The cell/DNA 
mixture was placed in a 2 mm electroporation cuvette and 
subjected to a pulsed electric field of 5 kv/cm using a BioRad 

30 Gene Pulser™ set to 1000 Q resistance and capacitance of 25 

/iF. After being pulsed, the cells were diluted by addition of 
1 ml YEPD and incubated at 3 0°C for one hour. The cells were 
then harvested by gentle centrif ugation and resuspended in 400 
^1 minimal selective media lacking adenine (ADE D) . The 

35 resuspended samples were split into 200 ^1 aliqouts and plated 
onto ADE D and ADE DS plates. Plates were incubated at 30^C 
for 4-5 days. Mutants #6 and #11 gave Ade* transf ormants . No 
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Ade* transf ormants were observed when DNA was omitted, hence 
the two isolates appeared to define the ade2 complementation 
group. The ADE2 sequence is shown in SEQ ID NO:l, 

5 Example 2 

The P. methanol ica clone bank disclosed in Example 
1 was used as a source for cloning the Alcohol Utilization 
Gene (AUGl) . The clone bank was stored as independent pools, 
each representing about 200-250 individual genomic clones. 

10 0.1 ^1 of "miniprep" DNA from each pool was used as a template 
in a polymerase chain reaction with PGR primers {8784, SEQ ID 
NO: 5; 8787, SEQ ID NO: 6) that were designed from an alignment 
of conserved sequences in alcohol oxidase genes from Hansenula 
polymorpha, Candida boidini, and Pichia pastoris. The 

15 amplification reaction was run for 30 cycles of 94 °C, 30 

seconds; 50°C, 30 seconds; 72*^0, 60 seconds; followed by a 7 
minute incubation at 72 °C. One pool (#5) gave a -600 bp band. 
DNA sequencing of this PGR product revealed that it encoded an 
amino acid sequence with --70% sequence identity with the 

20 Pichia pas tor is alcohol oxidase encoded by the AOXl gene and 
about 85% sequence identity with the Hansenula polymorpha 
alcohol oxidase encoded by the MOXl gene. The sequence of the 
cloned AUGl gene is shown in SEQ ID NO : 2 . 

Sub-pools of pool #5 were analyzed by PGR using the 

25 same primers used in the initial amplification. One positive 
sub-pool was further broken down to identify a positive 
colony. This positive colony was streaked on plates, and DNA 
was prepared from individual colonies. Three colonies gave 
identical patterns after digestion with Gla I. 

3 0 Restriction mapping of the genomic clone and PGR 

product revealed that the AUGl gene lay on a 7 . 5 kb genomic 
insert and that sites within the PGR fragment could be 
uniquely identified within the genomic insert. Because the 
orientation of the gene within the PGR fragment was known, the 

35 latter information provided the approximate location and 

direction of transcription of the AUGl gene within the genomic 
insert. DNA sequencing within this region revealed a gene 
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with very high sequence similarity at the amino acid level to 
other known alcohol oxidase genes . 



Example 3 

5 ade2 mutant P. methanolica cells are transformed by 

electroporation essentially as disclosed above with an 
expression vector comprising the AUGl promoter and terminator, 
human GAD65 DNA (Karlsen et al . , Proc . Natl, Acad, Sci . USA 
88:8337-8341, 1991), and ADE2 selectable marker. Colonies are 

10 patched to agar minimal methanol plates (10 to 100 colonies 
per 100-mm plate) containing 20 g/L Bacto™-agar (Dif co) , 6.7 
g/L yeast nitrogen base without amino acids (Difco) , 10 g/L 
methanol, and 0.4 /xg/L biotin. The agar is overlayed with 
nitrocellulose, and the plates are inverted over lids 

15 containing 1 ml of 50% methanol in water and incubated for 3 
to 5 days at 30 °C. The membrane is then transferred to a 
filter soaked in 0 . 2 M NaOH, 0.1% SDS, 35 mM dithiothreitol to 
lyse the adhered cells. After 3 0 minutes, cell debris is 
rinsed from the filter with distilled water, and the filter is 

20 neutralized by rinsing it for 30 minutes in 0.1 M acetic acid. 

The filters are then assayed for adhered protein. 
Unoccupied binding sites are blocked by rinsing in TTBS-NFM 
(20 mM Tris pH 7.4, 0,1% Tween 20, 160 mM NaCl , 5% powdered 
nonfat milk) for 30 minutes at room temperature. The filters 

25 are then transferred to a solution containing GAD6 monoclonal 
antibody (Chang and Gottlieb, J. Neurosci . 8:2123-2130, 1988), 
diluted 1:1000 in TTBS-NFM. The filters are incubated in the 
antibody solution with gentle agitation for at least one hour, 
then washed with TTBS (20 mM Tris pH 7 . 4 , 0.1% Tween 20, 160 

3 0 mM NaCl) two times for five minutes each. The filters are 
then incubated in goat ant i -mouse antibody conjugated to 
horseradish peroxidase (1 fig /ml in TTBS-NFM) for at least one 
hour, then washed three times, 5 minutes per wash with TTBS. 
The filters are then exposed to commercially available 

35 chemi luminescence reagents (ECL™; Amersham Inc., Arlington 
Heights, ID . Light generated from positive patches is 
detected on X-ray film. 
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To more accurately detect the level of GADgg 
expression, candidate clones are cultured in shake flask 
cultures . Colonies are grown for two days on rainimal methanol 
plates at 30 °C as disclosed above. The colonies are used to 
5 inoculate 20 ml of minimal methanol media (6.7 g/L yeast 

nitrogen base without amino acids, 10 g/L methanol, 0.4 /xg/L 
biotin) at a cell density of 1 x 10^ cells/ml. The cultures 
are grown for 1-2 days at 30°C with vigorous shaking. 0 . 2 ml 
of 50% methanol is added to each culture daily. Cells are 

10 harvested by centrif ugation and suspended in ice-cold lysis 

buffer (20 mM Tris pH 8.0, 40 mM NaCl, 2 mM PMSF, 1 mM EDTA, 
1 ^g/ml leupeptin, 1 fig/ml pepstatin, 1 /ig/ml aprotinin) at 10 
ml final volume per 1 g cell paste. 2.5 ml of the resulting 
suspension is added to 2 . 5 ml of 400-600 micron, ice-cold, 

15 acid- washed glass beads in a 15 -ml vessel, and the mixture is 
vigorously agitated for one minute, then incubated on ice for 
1 minute. The procedure is repeated until the cells have been 
agitated for a total of five minutes. Large debris and 
unbroken cells are removed by centrif ugation at 1000 x g for 5 

20 minutes. The clarified lysate is then decanted to a clean 

container. The cleared lysate is diluted in sample buffer (5% 
SDS, 8 M urea, 100 mM Tris pH 6.8, 10% glycerol, 2 mM EDTA, 
0.01% bromphenol glue) and electrophoresed on a 4-20% 
acrylamide gradient gel (Novex, San Diego, CA) . Proteins are 

25 blotted to nitrocellose and detected with GAD6 antibody as 
disclosed above . 

Clones exhibiting the highest levels of 
methanol -induced expression of foreign protein in shake flask 
culture are more extensively analyzed under high cell density 

30 fermentation conditions. Cells are first cultivated in 0.5 
liter of YEPD broth at 30°C for 1-2 days with vigorous 
agitation, then used to inoculate a 5 -liter fermentation 
apparatus (e.g., BioFlow III; New Brunswick Scientific Co., 
Inc., Edison, NJ) . The fermentation vessel is first charged 

3 5 with mineral salts by the addition of 57.8 g (^4)3804, 68 g 
KH2PO4, 30.8 g MgS04-7H20, 8.6 g CaS04- 2H2O, 2.0 g NaCl, and 10 
ml antifoam (PPG) . H2O is added to bring the volume to 2.5 L, 
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and the solution is autoclaved 40 minutes. After cooling, 350 
ml of 50% glucose, 250 ml 10 X trace elements (Table 4) , 25 ml 
of 200 /xg/ml biotin, and 250 ml cell inoculum are added. 

5 Table 4 

10 X trace elements: 



FeS04- 7H2O 


lOOmM 


27.8 g/L 


CuS04- 5H2O 


2mM 


0.5 g/L 


ZnCls 


8mM 


1.09 g/L 


MnSO^- H2O 


8mM 


1.35 g/L 


CoCls- 6H2O 


2mM 


0.48 g/L 


NajMoO^ • 2H2O 


ImM 


0.24 g/L 


H3BO3 


8mM 


0.5 g/L 


KI 


0 . 5mM 


0.08 g/L 


biotin 




5mg/L 


thiamine 




0.5 g/L 


mis H2SO4 per 


liter to bring compounds 


into solution. 



20 The fermentation vessel is set to run at 28*=*C, pH 

5.0, and >30% dissolved O2 - The cells will consume the 
initial charge of glucose, as indicated by a sharp demand for 
oxygen during glucose consumption followed by a decrease in 
oxygen consumption after glucose is exhausted. After 

2 5 exhaustion of the initial glucose charge, a glucose-methanol 

feed supplemented with NH4* and trace elements is delivered 
into the vessel at 0.2% (w/v) glucose, 0.2% (w/v) methanol for 
5 hours followed by 0.1% (w/v) glucose, 0.4% (w/v) methanol 
for 25 hours. A total of 550 grams of methanol is supplied 

3 0 through one port of the vessel as pure methanol using an 

initial delivery rate of 12.5 ml/hr and a final rate of 25 
ml/hr. Glucose is supplied through a second port using a 700 
ml solution containing 175 grams glucose, 250 ml lOX trace 
elements, and 99 g (NH4)2S04. Under these conditions the 
3 5 glucose and methanol are simultaneously utilized, with the 
induction of GADgg expression upon commencement of the 
glucose-methanol feed. Cells from the fermentation vessel are 
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analyzed for GADgs expression as described above for shake 
flask cultures. 

Cells are removed from the fermentation vessel at 
certain time intervals and subsequently analyzed. Little GADgs 
5 expression is observed during growth on glucose. Exhaustion 
of glucose leads to low level expression of the GAD^s protein; 
expression is enhanced by the addition of MeOH during feeding 
of the fermentation culture. The addition of methanol has a 
clear stimulatory effect of the expression of human GADgg 
10 driven by the methanol -responsive AUGl promoter. 

Example 4 

Transformation conditions were investigated to 
determine the electric field conditions, DNA topology, and DNA 
15 concentration that were optimal for efficient transformation 
of P. methcLnolica. All experiments used P. methanolica ade2 
strain #11. Competent cells were prepared as previously 
described. Electroporation was carried out using a BioRad 
Gene Pulser^. 

20 Three field parameters influence transformation 

efficiency by electroporation: capacitance, field strength, 
and pulse duration. Field strength is determined by the 
voltage of the electric pulse, while the pulse duration is 
determined by the resistance setting of the instrument. 

25 Within this set of experiments, a matrix of field strength 
settings at various resistances was examined. In all 
experiments, the highest capacitance setting (25 ^F) of the 
instrument was used. 100 fil aliquots of electrocompetent 
cells were mixed on ice with 10 ^il of DNA that contained 

3 0 approximately 1 fig of the ADE2 plasmid pC2R133 that had been 
linearized with the restriction enzyme Not I. Cells and DNA 
were transferred to 2 mm electroporation cuvettes (BTX Corp., 
San Diego, CA) and electropulsed at field strengths of 0.5 kV 
(2.5 kV/cm) , 0.75 kV (3.75 kV/cm) , 1.0 kV (5.0 kV/cm) , 1.25 kV 

35 (6.25 kV/cm) , and 1.5 kV (7.5 kV/cm) . These field strength 
conditions were examined at various pulse durations. Pulse 
duration was manipulated by varying the instrument setting 
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resistances to 200 ohms, 600 ohms, or "infinite" ohms. Pulsed 
cells were suspended in YEPD and incubated at 30«*C for one 
hour, harvested, resuspended, and plated. Three separate sets 
of experiments were conducted. In each set, electroporation 
5 conditions of 0.75 kV {3.75 kV/cm) at a resistance of 

"infinite" ohms was found to give a dramatically higher 
transformation efficiency than other conditions tested (see 
Fig. 1) • 

After the optimal pulse conditions were 

10 established, the influence of DNA topology on transformation 
efficiency was investigated. Electrocompetent cells were 
mixed with 1 fig of uncut, circular pCZR13 3 or with 1 /ig of Not 
I-digested pCZR133 . In three separate experiments, an average 
of roughly 2 5 transf ormants were recovered with circular DNA 

15 while linear DNA yielded an average of nearly 1 x 10* 

transf ormants . These data indicate that linear DNA transforms 
P. methanolica. with much greater efficiency than circular DNA, 
Finally, the relationship between DNA concentration 
and transformation efficiency was investigated. Aliquots of 

20 linear pCZR133 DNA (1 ng, 10 ng, 100 ng and 1 fxg in 10 fil K^O) 
were mixed with 100 ^il electrocompetent cells, and 
electroporation was carried out at 3.75 kV/cm and "infinite" 
ohms. The number of transf ormants varied from about 10 (1 ng 
DNA) to 10^ (1 MS DNA) and was found to be proportional to the 

25 DNA concentration. 

Example 5 

Integration of transforming DNA into the genome of 
P. methanolica. was detected by comparison of DNA from 

3 0 wild- type cells and stable, white transf ormant colonies. Two 
classes of integrative transf ormants were identified. In the 
first, transforming DNA was found to have integrated into a 
homologous site. In the second class, transforming DNA was 
found to have replaced the endogenous AUGl open reading frame. 

35 While not wishing to be bound by theory, this second 

transf ormant is believed to have arisen by a " transplacement 
recombination event" (Rothstein, Methods Enzvmol . 194:281-301, 
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1991) whereby the transforming DNA replaces the endogenous DNA 
via a double recombination event. 

P, methanolica ade2 strain #11 was transformed to 
Ade"^ with Asp I -digested pCZR14 0, a Bluescript® (Stratagene 
5 Cloning Systems, La Jolla, CA) -based vector containing the P. 
methanol ica ADE2 gene and a mutant of AUGl in which the entire 
open reading frame between the promoter and terminator regions 
has been deleted (Fig. 2) . Genomic DNA was prepared from 
wild-type and transformant cells grown for two days on YEPD 

10 plates at 30^C. About 100-200 fxl of cells was suspended in 1 
ml H2O, then centrifuged in a microcentrifuge for 30 seconds. 
The cell pellet was recovered and resuspended in 4 00 /xl of SCE 
+ DTT + zymolyase (1.2 M sorbitol, 10 mM Na citrate, 10 mM 
EDTA, 10 mM DTT, 1-2 mg/ml zymolyase lOOT) and incubated at 

15 37^C for 10-15 minutes. 400 ^,1 of 1% SDS was added, and the 
solution was mixed until clear. 3 00 /il of 5 M potassium 
acetate, pH 8 . 9 was added, and the solution was mixed and 
centrifuged at top speed in a microcentrifuge for five 
minutes. 750 fil of the supernatant was transferred to a new 

20 tube and extracted with an equal volume of phenol/chlorof oirm. 

600 fil of the resulting supernatant was recovered, and DNA was 
precipitated by the addition of 2 volumes of ethanol and 
centrifugation for 15 minutes in the cold. The DNA pellet was 
resuspended in 50 ml TE {10 mM Tris pH 8 , 1 mM EDTA) + 100 

25 ^g/ml RNAase for about 1 hour at 65*^0. 10 ^1 DNA samples were 
digested with Eco RI (5 /xl) in a 100 ^1 reaction volume at 
37*=*C overnight. DNA was precipitated with ethanol, recovered 
by centrifugation, and resuspended in 7.5 ^1 TE + 2.5 ^1 5X 
loading dye. The entire 10 ^ll volume was applied to one lane 

30 of a 0.7% agarose in 0 . 5 X TBE (10 X TBE is 108 g/L Tris base 
7-9, 55 g/L boric acid, 8.3 g/L disodium EDTA) gel. The gel 
was run at 100 V in 0.5 X TBE containing ethidium bromide. 
The gel was photographed, and DNA was elect rophoretically 
transferred to a positively derivatized nylon membrane 

35 (Nytran® N+, Schleicher & Schuell, Keene, NH) at 400 mA, 20 mV 
for 3 0 minutes. The membrane was then rinsed in 2 X SSC, 
blotted onto denaturation solution for five minutes. 
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neutralized in 2 X SSC, then cross -linked damp in a UV 
crosslinker (Stratalinker® , Stratagene Cloning Systems) on 
automatic setting. The blot was hybridized to a PCR-generated 
AUGl promoter probe using a commercially available kit (ECL"* 
5 kit, Amersham Corp., Arlington Heights, XL). Results 

indicated that the transforming DNA altered the structure of 
the AUGl promoter DNA, consistent with a homologous 
integration event (Fig. 2) . 

In a second experiment, P. methanolica ade 2 strain 

10 #11 was transformed to Ade* with Not I -digested pCZR137, a 

vector containing a human GAD65 cDNA between the AUGl promoter 
and terminator (Fig. 3) . Genomic DNA was prepared as 
described above from wild- type cells and a stable, white, Ade* 
transf ormant and digested with Eco RI . The digested DNA was 

15 separated by electrophoresis and blotted to a membrane. The 
blot was probed with a PCR-generated probe corresponding to 
either the AUGl open reading frame or the AUGl promoter. 
Results demonstrated that the AUGl open reading frame DNA was 
absent from the transf ormant strain, and that the AUGl 

20 promoter region had undergone a significant rearrangement. 

These results are consistent with a double recombination event 
(transplacement ) between the transforming DNA and the host 
genome (Fig. 3) . 

25 Example 6 

An AUGl strain of P. methanolica is grown in 
high-density fermentation conditions. The fermentation vessel 
is charged with mineral salts by the addition of 57,8 g 
(NH4)2S04, 46.6 g KCl, 30.8 g MgS04- VH^O, 8.6 g CaS04- 2H2O, 2.0 g 

3 0 NaCl, and 10 ml antifoam (PPG) . H2O is added to bring the 
volume to 2.5 L, and the solution is autoclaved 40 minutes. 
After cooling, 350 ml of 50% glucose, 250 ml 10 X trace 
elements (Table 4), 210 ml of 30% NaPhosphate, 25 ml 200 fig/ml 
biotin, and 250 ml cell inoculum are added. Cells are 

3 5 batch- fed glucose or glucose/methanol in three phases. In 
phase 1, the cells receive 0.4%/L/hour glucose (w/v final 
fermentation volume) for 25 hours using 750 g glucose, 110 g 
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{NH4)2S04, and 278 ml 10 X trace elements per 1.5 liter. The 
cells are then given a transition feed of 0.2% glucose, 0.2% 
methanol/L/hour for 5 hours. The final glucose -supplemented 
methanol feed contains 0.1% glucose, 0.4% methanol/L/hr for 25 
5 hours. Final biomass is about 300 g/L cell paste. 

Example 7 

For fermentation of a P. methanol ica. auglA strain, 
the fermentation vessel is initially charged with mineral 
salts, glucose, phosphate, trace elements and biotin as 
disclosed in Example 6. 250 ml of cell inoculum is added. A 
glucose feed is prepared using 600 g glucose, 108 g (NH4)2S04, 
and 273 ml 10 X trace elements per 1.2 liter. The cells are 
batch-fed in three phases. In the first phase, the cells 
receive glucose for 12 to 25 hours at 0.4%/L/hour. The cells 
are then induced with a bolus addition of 1% methanol by- 
weight and transitioned to methanol utilization with a mixed 
0.2% glucose/0.1% methanol feed for 10 hours. In the third 
phase, a mixed feed of 0.2% glucose, 0.2% methanol is 
delivered for 15 hours. 

Example 8 

P. wethanolica cells in which the AUGl gene had 
been disrupted by insertion of a GAD65 expression construct 
25 retained the ability to grow on methanol, indicating that a 
second alcohol oxidase gene was present. The second gene, 
designated AUG2, was identified by PGR. Sequence analysis of 
the 5' coding region of the gene showed that the N- terminus of 
the encoded protein was similar to those of known alcohol 
3 0 oxidase genes. 

Strain MC GADS, a transformant that grew very 
poorly on minimal methanol broth, was used as a source for 
cloning the AUG2 gene . Genomic DNA was prepared from MC GADS 
and amplified with sense and antisense PGR primers specific 
35 for the AUGl open reading frame (8784, SEQ ID NO: 5; 8787, SEQ 
ID NO: 6) . A product identical in size to the AUGl product but 
showing very low intensity on an analytical gel was obtained. 



15 
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The putative AUG2 PGR product was digested with a 
battery of restriction enzymes. Partial digestion by Eco RI 
and Pvu I, and the presence of several Bgl II sites suggested 
that the DNA was contaminated with small amounts of AUGl , To 
5 remove the contaminating AUGl DNA, the PGR mixture was cut 

with Eco RI and gel purified. Since the MG GAD 8 product did 
not appear to have an Eco RI site, it was unaffected. The 
resulting gel-purified DNA was reamplified and again analyzed 
by restriction digestion. The DNA gave a different 

10 restriction map from that of the AUGl PGR product. 

Southern blot analysis was performed on genomic DNA 
from MG GAD8 and wild- type cells using either AUGl or AUG2 
open reading frame PGR fragments as probes. The AUG2 probe 
hybridized at low stringency to the AUGl locus and at both low 

15 and high stringency to a second locus . The AUGl probe bound 

to both loci at low stringency, but bound predominantly to the 
AUGl locus at high stringency. These data indicated that the 
new PGR product from MG GAD8 was similar to but distinct from 
AUGl. Sequence analysis showed an 83% identity between AUGl 

2 0 and AUG2 gene products. 

To clone the AUG2 genomic locus, PGR primers were 
designed from the original AUG2 PGR fragment. Primers 9885 
(SEQ ID NO: 7) and 9883 (SEQ ID NO: 8) were used to screen a P. 
methanolica. genomic library. A positive clone bank pool was 
25 then probed with the original MG GADS PGR product. Gells were 
plated on 10 plates at about 5000 colonies/plate and grown 
overnight, then the plates were overlayed with filter discs 
(Hybond-N, Amersham Gorp., Arlington Heights, ID . Golonies 
were denatured, neutralized, and UV cross- linked. Bacterial 

3 0 debris was washed from the filters with 5X SSG, and the 

filters were again cross- linked. Blots were pre-hybridized in 
pairs at 42 °G for 1 hour in 25 ml hybridization buffer. 
Approximately 25 0 ng of probe was then added to each pair of 
filters. Hybridization was conducted at 42*=*G for four hours. 
3 5 The blots were then washed in 50 0 ml of 0*1 X SSG, 6M urea, 
0.4% SDS at 42 °G for 10 minutes, four times. The blots were 
then neutralized with 500 ml of 2 X SSG at room temperature 
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for 5 minutes, two rinses. The blots were then immersed in 
100 ml development reagent (ECL, Amersham Corp.). 

Positive colonies were picked and amplified using 
PGR primers 9885 (SEQ ID NO:7) and 9883 (SEQ ID NO:8) to 
5 confirm their identity. Positive pools were streaked on 
plates, and single colonies were rescreened by PGR. One 
colony was selected for further analysis (restriction mapping 
and sequencing) . A partial sequence of the AUG2 gene is shown 
in SEQ ID NO: 9. As shown in SEQ ID NO: 9, the AUG2 sequence 

10 begins at the Hindlll site a nucleotide 91. Nucleotides 

upstream from this position are vector sequence. The coding 
sequence begins at nucleotide 170. 

Disruption of the AUG2 gene had little effect on 
cell growth on methanol. Cells lacking both functional AUGl 

15 and AUG2 gene products did not grow on methanol. Subsequent 
analysis showed that the AUGl gene product is the only 
detectable alcohol oxidase in cells grown in a fermentor. 

From the foregoing, it will be appreciated that, 
2 0 although specific embodiments of the invention have been 
described herein for purposes of illustration, various 
modifications may be made without deviating from the spirit 
and scope of the invention. Accordingly, the invention is not 
limited except as by the appended claims. 



25 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: ZymoGenetics 

(B) STREET: 1201 Eastlake Avenue E. 

(C) CITY: Seattle 

(D) STATE: Washington 

(E) COUNTRY: United States of America 

(F) POSTAL CODE (ZIP) : 98102 

(G) TELEPHONE: 

(H) TELEFAX: 

(I) TELEX: 

(ii) TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR PRODUCING 
HETEROLOGOUS POLYPEPTIDES IN PICHIA METHANOLICA 

(iii) NUMBER OF SEQUENCES: 9 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 08-NOV-1996 

(C) CLASSIFICATION: 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/006,397 

(B) FILING DATE: 09-NOV-1995 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/703,807 

(B) FILING DATE: 26 -AUG- 1996 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/703,809 

(B) FILING DATE: 26 -AUG- 1996 

(vii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Parmelee, Steven W. 

(B) REGISTRATION NUMBER: 31,990 

(C) REFERENCE/DOCKET NUMBER: 13952-46PC 

(viii) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (206) 467-9600 

(B) TELEFAX: (415) 576-0300 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3077 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doilble 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION: 
CAGCTGCTCT GCTCCTTGAT TCGTAATTAA 
TCCCCAACAG GGATTCCAAT CGGTGCTCAG 
TATTGATGCT GCAAAAACTT TTTTAGCCGG 
CTGTGGGCGT TCCACACTCC TTGCTTTTCA 
TGATTCTCTT ATTACCAGTT ATGTAGAAAG 
GAACGCTGAC CCACGGTTTC AAATAACTAT 
TGCTTGCTTA AAGCGGCTAA AAAGTGTTTG 
CTCCTGTAAA GGGCCGATTC GACTTCGAAA 
AAAATTGCTA AAGGAGTACT AGGGCTGTAG 
AAAAGAATGA CGCTGTATGT CGTAGCCTGC 
GCAAATCTGT TGGTCACCGG TTCGATCCGG 
TTTGCGGGTA GGAAGCAAGG TCTAGTTTTC 
CCATGAGTGT TTCCCTCTGG CTACCTAATA 
TTCTTTCCAA GTTCGGCTTT CAGCTCGTAA 
TTTCAGAGTC AAATTAATTT TCGCTAACAA 
TAACTGATAA GTCGAATCAA CATCTTTAAA 
TATTAACCAA TAGCATATTC ACAGGCATCA 
TGTCGGGATT TTAGGTGGTG GCCAACTTGG 
GAATATCAAA ACTGTGATTC TCGAAAATGG 
TTTAGATGAC CATATTGACG GCTCATTCAA 
CAAGTGTGAT GTTTTAACCG TTGAGATTGA 
TCAAAAGGCA ACTGGCATCA AAATCTTCCC 
TAAATACTTG CAAAAAGAGC ATTTGATTAA 
TGTTGAAAGT AGCGCAGCAT CTTTAGAAGA 
GCTAAAATCT AGAACAATGG CCTATGACGG 
ATATATACCT GAAGCTTTGA AAGTTTTAGA 
TCCATTTTCA AAGGAGTTAG CTGTTATGGT 
CTACCCAACT GTTGAAACCA TCCACCAAAA 
TAGAGTTAAC GATACTGTCC AAAAGAAGGC 
TTTCCCAGGT GCTGGTATCT TTGGTGTTGA 
AGTCAACGAA ATTGCCCCAA GACCTCACAA 
CACCTCGCAA TTTGAAGCTC ATGTTAGGGC 
CACTTGTTTG TCGACTCCAT CTACCCAAGC 



41 

SEQ ID NO:l: 

TGTTATCCTT TTACTTTGAA CTCTTGTCGG 60 

CGGGATTTCC CATGAGGTTT TTGACAACTT 120 

GTTTAAGTAA CTGGGCAATA TTTCCAAAGG 180 

TAATCTCTGT GTATTGTTTT ATTCGCATTT 240 

ATCGGCAAAC AAAATATCAA CTTTTATCTT 300 

CAGAACTCTA TAGCTATAGG GGAAGTTTAC 360 

GCAAATTAAA AAAGCTGTGA CAAGTAGGAA 42 0 

GAGCCTAAAA ACAGTGACTA TTGGTGACGG 480 

TAATAAATAA TGGAACAGTG GTACAACAAT 54 0 

ACGAGTAGCT CAGTGGTAGA GCAGCAGATT 600 

TCTCGGGCTT CCTTTTTTGC TTTTTCGATA 66 0 

GTCGTTTCGG ATGGTTTACG AAAGTATCAG 720 

TATTTATTGA TCGGTCTCTC ATGTGAATGT 780 

ATGTGCAAGA AATATTTGAC TCCAGCGACC 840 

TTTGTGTTTT TCTGGAGAAA CCTAAAGATT 900 

TCCTTTAGTT AAGATCTCTG CAGCGGCCAG 960 

CATCGGAACA TTCAGAATGG ACTCGCAAAC 1020 

TCGTATGATC GTTGAAGCTG CACACAGATT 1080 

AGACCAGGCT CCAGCAAAGC AAATCAACGC 114 0 

TGATCCAAAA GCAATTGCCG AATTGGCTGC 12 00 

ACATGTTGAC ACTGATGCGT TGGTTGAAGT 1260 

ATCACCAGAA ACTATTTCAT TGATCAAAGA 1320 

GAATGGCATT GCTGTTGCCG AATCTTGTAG 13 80 

AGTTGGTGCC AAATACGGCT TCCCATACAT 1440 

AAGAGGTAAT TTTGTTGTCA AAGACAAGTC 1500 

TGACAGGCCG TTATACGCCG AGAAATGGGC 1560 

TGTGAGATCA ATCGATGGCC AAGTTTATTC 162 0 

CAACATCTGT CACACTGTCT TTGCTCCAGC 1680 

CCAAATTTTG GCTGACAACG CTGTCAAATC 174 0 

AATGTTTTTA TTACAAAATG GTGACTTATT 1800 

TTCTGGTCAC TATACCATCG ACGCTTGTGT 1860 

CATTACTGGT CTACCCATGC CGAAGAACTT 1920 

TATTATGTTG AACGTTTTAG GTGGCGATGA 1980 
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GCAAAACGGT 


GAGTTCAAGA 


TGTGTAAAAG 


AGCACTAGAA 


ACTCCTCATG 


CTTCTGTTTA 


2040 


CTTATACGGT 


AAGACTACAA 


GACCAGGCAG 


AAAAATGGGT 


CACATTAATA 


TAGTTTCTCA 


2100 


ATCAATGACT 


GACTGTGAGC 


GTAGATTACA 


TTACATAGAA 


GGTACGACTA 


ACAGCATCCC 


2160 


TCTCGAAGAA 


CAGTACACTA 


CAGATTCCAT 


TCCGGGCACT 


TCAAGCAAGC 


CATTAGTCGG 


2220 


TGTCATCATG 


GGTTCCGATT 


CGGACCTACC 


AGTCATGTCT 


CTAGGTTGTA 


ATATATTGAA 


2280 


GCAATTTAAC 


GTTCCATTTG 


AAGTCACTAT 


CGTTTCCGCT 


CATAGAACCC 


CACAAAGAAT 


2340 


GGCCAAGTAT 


GCCATTGATG 


CTCCAAAGAG 


AGGGTTGAAG 


TGCAT CATTG 


CTGGTGCTGG 


2400 


TGGTGCCGCT 


CATTTACCGG 


GAATGGTTGC 


GGCGATGACG 


CCGCTGCCTG 


TTATTGGTGT 


2460 


CCCTGTTAAA 


GG CTCT ACTT 


TGGATGGTGT 


TGATTCACTA 


CACTCCATCG 


TTCAAATGCC 


2520 


AAGAGGTATT 


CCTGTTGCTA 


CTGTGGCTAT 


TAACAATGCT 


ACTAACGCTG 


CCTTGCTAGC 


2580 


TATCACAATC 


TTAGGTGCCG 


GCGATCCAAA 


TACTTGTCTG 


CAATGGAAGT 


TTATATGAAC 


2640 


AATATGGAAA 


ATGAAGTTTT 


GGGCAAGGCT 


GAAAAATTGG 


AAAATGGTGG 


ATATGAAGAA 


2700 


TACTTGAGTA 


CATACAAGAA 


GTAGAACCTT 


TTATATTTGA 


TATAGTACTT 


ACTCAAAGTC 


2760 


TTAATTGTTC 


TAACTGTTAA 


TTTCTGCTTT 


GCATTTCTGA 


AAAGTTTAAG 


ACAAQAAATC 


2820 


TTGAAATTTC 


TAGTTGCTCG 


TAAGAGGAAA 


CTTGCATTCA 


AATAACATTA 


ACAATAAATG 


2880 


ACAATAATAT 


ATTATTTCAA 


CACTGCTATA 


TGGTAGTTTT 


ATAGGTTTGG 


TTAGGATTTG 


2940 


AGATATTGCT 


AGCGCTTATC 


ATTATCCTTA 


ATTGTTCATC 


GACGCAAATC 


GACGCATTTC 


3000 


CACAAAAATT 


TTCCGAACCT 


GTTTTTCACT 


TCTCCAGATC 


TTGGTTTAGT 


ATAGCTTTTG 


3060 


ACACCTAATA 


CCTGCAG 










3077 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

GAATTCCTGC AGCCCGGGGG ATCGGGTAGT GGAATGCACG GTTATACCCA CTCCAAATAA 60 

AAGTGTAGTA GCCGGACTGA AAGGTTTTAG GAGTCTGTTT GTTTGTTCAT GTGCATCATT 120 

CCCTAATCTG TTAACAGTCT CGGAGTATAC AAAAAAGTAA GTCAAATATC AAGGTGGCCG 180 

GGGGCAGCAT CGAGACTCGA GATGGTACAT ACTTAAAAGC TGCCATATTG AGGAACTTCA 24 0 

AAGTTTTATC TGTTTTTAGA ATTAAAAGAC GATTGTTGTA ACAAAACGTT GTGCCTACAT 300 
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AAACTCAAAT TAATGGAAAT AGCCTGTTTT GAAAAATACA CCTTCTTAAG TACTGACAAA 360 

GTTTTGTTAA ATGACTATCG AACAAGCCAT GAAATAGCAC ATTTCTGCCA GTCACTTTTA 42 0 

ACACTTTCCT GCTTGCTGGT TGACTCTCCT CATACAAACA CCCAAAAGGG AAACTTTCAG 48 0 

TGTGGGGACA CTTGACATCT CACATGCACC CCAGATTAAT TTCCCCAGAC GATGCGGAGA 54 0 

CAAGACAAAA CAACCCTTTG TCCTGCTCTT TTCTTTCTCA CACCGCGTGG GTGTGTGCGC 6 00 

AGGCAGGCAG GCAGGCAGCG GGCTGCCTGC CATCTCTAAT CGCTGCTCCT CCCCCCTGGC 660 

TTCAAATAAC AGCCTGCTGC TATCTGTGAC CAGATTGGGA CACCCCCCTC CCCTCCGAAT 72 0 

GATCCATCAC CTTTTGTCGT ACTCCGACAA TGATCCTTCC CTGTCATCTT CTGGCAATCA 780 

GCTCCTTCAA TAATTAAATC AAATAAGCAT AAATAGTAAA ATCGCATACA AACGTCATGA 840 

AAAGTTTTAT CTCTATGGCC AACGGATAGT CTATCTGCTT AATTCCATCC ACTTTGGGAA 900 

CCGCTCTCTC TTTACCCCAG ATTCTCAAAG CTAATATCTG CCCCTTGTCT ATTGTCCTTT 96 0 

CTCCGTGTAC AAGCGGAGCT TTTGCCTCCC ATCCTCTTGC TTTGTTTCGG TTATTTTTTT 1020 

TTCTTTTGAA ACTCTTGGTC AAATCAAATC AAACAAAACC AAACCTTCTA TTCCATCAGA 108 0 

TCAACCTTGT TCAACATTCT ATAAATCGAT ATAAATATAA CCTTATCCCT CCCTTGTTTT 114 0 

TTACCAATTA ATC7UVTCTTC AAATTTCAAA TATTTTCTAC TTGCTTTATT ACTCAGTATT 1200 

AACATTTGTT TAAACCAACT ATAACTTTTA ACTGGCTTTA GAAGTTTTAT TTAACATCAG 1260 

TTTCAATTTA CATCTTTATT TATTAACGAA ATCTTTACGA ATTAACTCAA TCAAAACTTT 132 0 

TACGAAAAAA AAATCTTACT ATTAATTTCT CAAAATGGCT ATTCCAGATG AATTTGATAT 1380 

TATTGTTGTC GGTGGTGGTT CCACCGGTTG TGCTCTTGCT GGTAGATTAG GTAACTTGGA 144 0 

CGAAAACGTC ACAGTTGCTT TAATCGAAGG TGGTGAAAAC AACATCAACA ACCCATGGGT 1500 

TTACTTACCA GGTGTTTATC CAAGAT^CAT GAGATTAGAC TCAAAGACTG CTACTTTTTA 1560 

CTCTTCAAGA CCATCACCAC ACTTGAACGG TAGAAGAGCT ATTGTTCCAT GTGCTAACAT 162 0 

CTTGGGTGGT GGTTCTTCCA TCAACTTCTT GATGTACACC AGAGCCTCTG CCTCCGATTA 168 0 

CGATGATTGG GAATCTGAAG GTTGGACTAC CGATGAATTA TTACCACTAA TGAAGAAGAT 174 0 

TGAAACTTAT CAAAGACCAT GTAACAACAG AGAATTGCAC GGTTTCGATG GTCCAATTAA 180 0 

GGTTTCATTT GGTAACTATA CTTATCCAAA CGGTCAAGAT TTCATTAGAG CTGCCGAATC 186 0 

TCAAGGTATT CCATTTGTTG ATGATGCTGA AGATTTGAAA TGTTCCCACG GTGCTGAGCA 192 0 

CTGGTTGAAG TGGATCAACA GAGACTTAGG TAGAAGATCC GATTCTGCTC ATGCTTACAT 198 0 

TCACCCAACC ATGAGAAACA AGCAAAACTT GTTCTTGATT ACTTCCACCA AGTGTGAAAA 204 0 

GATTATCATT GAAAACGGTG TTGCTACTGG TGTTAAGACT GTTCCAATGA AGCCAACTGG 2100 

TTCTCCAAAG ACCCAAGTTG CTAQAACTTT CAAGGCTAGA AAGCAAATTA TTGTTTCTTG 216 0 

TGGTACTATC TCATCACCAT TAGTTTTGCA AAGATCTGGT ATCGGTTCCG CTCACAAGTT 222 0 

GAGACAAGTT GGTATTAAAC CAATTGTTGA CTTACCAGGT GTTGGTATGA ACTTCCAAGA 228 0 

TCACTACTGT TTCTTCACTC CATACCATGT CAAGCCAGAT ACTCCATCAT TCGATGACTT 234 0 
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TGTTAGAGGT 


GATAAAGCTG 


TTCAAAAATC 


TGCTTTCGAC 


CAATGGTATG 


CTAACAAGGA 


2400 


TGGTCCATTA 


ACCACTAATG 


GTATTGAGGC 


AGGTGTTAAG 


ATTAGACCAA 


CTGAAGAAGA 


2460 


ATTAGCCACT 


GCTGATGACG 


AATTCTIGAGC 


TGCTTATGAT 


GACTACTTTG 


GTAACAAGCC 


2520 


AGATAAGCCA 


TTAATGCACT 


ACTCTCTAAT 


TTCTGGTTTC 


TTTGGTGACC 


ACACCAAGAT 


2580 


TCCAAACGGT 


AAGTACATGT 


GCATGTTCCA 


CTTCTTGGAA 


TATCCATTCT 


CCAGAGGTTT 


2640 


CGTTCACGTT 


GTTTCTCCAA 


ACCCATACGA 


TGCTCCTGAC 


TTTGATCCAG 


GTTTCATGAA 


2700 


CGATCCAAGA 


GATATGTGGC 


CAATGGTTTG 


GTCTTACAAG 


AAGTCCAGAG 


AAACTGCCAG 


2760 


AAGAATGGAC 


TGTTTTGCCG 


GTGAAGTTAC 


TTCTCACCAC 


CCACACTACC 


CATACGACTC 


2820 


ACCAGCCAGA 


GCTGCTGACA 


TGGACTTGGA 


AACTACTAAA 


GCTTATGCTG 


GTCCAGACCA 


2880 


CTTTACTGCT 


AACTTGTACC 


ACGGTTCATG 


GACTGTTCCA 


ATTGAAAAGC 


CAACTCCAAA 


2940 


GAACGCTGCT 


CACGTTACTT 


CTAACCAAGT 


TGAATU^CAT 


CGTGACATCG 


AATACACCAA 


3000 


GGAGGATGAT 


GCTGCTATCG 


AAGATTACAT 


CAGAGAACAC 


ACTGAAACCA 


CATGGCATTG 


3060 


TCTTGGTACT 


TGTTCAATGG 


CTCCAAGAGA 


AGGTTCTAAG 


GTTGTCCCAA 


CTGGTGGTGT 


3120 


TGTTGACTCC 


AGATTAAACG 


TTTACGGTGT 


TGAAAAGTTG 


AAGGTTGCTG 


ATTTATCAAT 


3180 


TTGCCCAGAT 


AATGTTGGTT 


GTAACACTTA 


CTCTACTGCT 


TTGTTAATCG 


GTGAAAAGGC 


3240 


TTCTACCTTA 


GTTGCTGAAG 


ACTTGGGCTA 


CTCTGGTGAT 


GCTTTGAAGA 


TGACTGTTCC 


3300 


AAACTTCAAA 


TTGGGTACTT 


ATGAAGAAGC 


TGGTCTAGCT 


AGATTCTAGG 


GCTGCCTGTT 


3360 


TGGATATTTT 


TATAATTTTT 


GAGAGT 








3386 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

TGATCACCTA GGACTAGTGA CAAGTAGGAA CTCCTGTA 3 8 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



CAGCTGCCTA GGACTAGTTT CCTCTTACGA GCAACTAGA 



39 
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( 2 ) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
TGGTTGAAGT GGATCAA 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 
GTGTGGTCAC CGAAGAA 

( 2 ) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC9885 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 
GTTGTTCCTT CCAAACCATT GAAC 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZC98B3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 
AAAGTAAGAA GCGTAGCCTA GTTG 
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(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 329 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

GACCATGATT ACGCCAAGCG CGCAATTAAC CCTCACTAAA GGGAACAAAA GCTGGGTACC 6 0 

GGGCCCCCCC TCGAGGTCGA CGGTATCGAT AAGCTTTATT ATAACATTAA TATACTATTT 120 

TATAACAGGA TTGAAAATTA TATTTATCTA TCTAAAACTA AAATTCAAAA TGGCTATTCC 180 

TGAAGAATTC GATATCATTG TTGTCGGTGG TGGTTCTGCC GGCTGTCCTA CTGCTGGTAG 240 

ATTGGCTAAC TTAGACCCAA ATTTAACTGT TGCTTTAATC GAAGCTGGTG AAAACAACAT 300 

TAACAACCCA TGGGTCTACT TACCAGGCG 329 
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What is claimed: 



1 1. A DNA construct comprising the following 

2 operably linked elements: 

3 a transcription promoter of a Pichia methanol ica 

4 gene ; 

5 a DNA segment encoding a polypeptide heterologous 

6 to P. methanol ica; 

7 a transcription terminator of a P. methanolica 

8 gene ; and 

9 a selectable marker. 

1 2 . A DNA construct according to claim 1 wherein 

2 said transcription promoter is a promoter of a 

3 methanol -inducible P. methanolica gene. 

1 3 . A DNA construct according to claim 2 wherein 

2 said promoter is a promoter of a gene selected from the group 

3 consisting of P, methanolica alcohol oxidase, dihydroxyacetone 

4 synthase, formate dehydrogenase, and catalase genes. 

1 4 . A DNA construct according to claim 3 wherein 

2 said promoter comprises a sequence of nucleotides as shown in 

3 SEQ ID NO: 2 from nucleotide 24 to nucleotide 1354. 

1 5. A DNA construct according to claim 3 wherein 

2 said promoter comprises a sequence of nucleotides as shown in 

3 SEQ ID NO: 9 from nucleotide 91 to nucleotide 16 9. 

1 6 . A DNA construct according to claim 1 wherein 

2 said selectable marker is a P. methanolica gene. 



1 7, A DNA construct according to claim 6 wherein 

2 said marker is a P. methanolica ADE2 gene. 
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1 8 . A DNA construct according to claim 7 wherein 

2 said ADE2 gene comprises a sequence of nucleotides as shown in 

3 SEQ ID NO:l from nucleotide 407 to nucleotide 2851. 

1 9 . A DNA construct according to claim 1 wherein 

2 said selectable marker is a gene from a fungus other than P. 

3 me tha^nol i ca. . 

1 10. A DNA construct according to claim 9 wherein 

2 said selectable marker is a Saccharomyces cerevisiae gene. 

1 11. A DNA construct according to claim 1 which is 

2 a linear molecule. 

1 12 . A DNA construct according to claim 1 wherein 

2 said selectable marker is a dominant selectable marker, 

1 13 . A P. me tha.no lica. cell containing a DNA 

2 construct according to claim 1, wherein the cell expresses the 

3 DNA segment . 

1 14 . A P. methanolica cell according to claim 13 

2 wherein the DNA construct is integrated into the host cell 

3 genome . 

1 15 . A cell according to claim 13 wherein said 

2 transcription promoter is a promoter of a methanol- inducible 

3 P . me thanol i ca gene . 

1 16. A cell according to claim 15 wherein said 

2 promoter is a promoter of a gene selected from the group 

3 consisting of P. methanolica alcohol oxidase, dihydroxyacetone 

4 synthase, formate dehydrogenase, and catalase genes. 

1 17. A cell according to claim 16 wherein said 

2 promoter comprises a sequence of nucleotides as shown in SEQ 

3 ID NO : 2 from nucleotide 24 to nucleotide 1354. 
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1 18 . A cell according to claim 16 wherein said 

2 promoter comprises a sequence of nucleotides as shown in SEQ 

3 ID NO: 9 from nucleotide 91 to nucleotide 169. 

1 19. A cell according to claim 13 wherein said 

2 selectable marker is a P. methanolica gene. 

1 20. A cell according to claim 19 wherein said 

2 marker is a P. methanolica ADE2 gene. 

1 21. A cell according to claim 20 wherein said ADE2 

2 gene comprises a sequence of nucleotides as shown in SEQ ID 

3 NO:l from nucleotide 407 to nucleotide 2851. 

1 22. A cell according to claim 13 wherein said 

2 selectable marker is a gene from a fungus other than P. 

3 methanolica. 

1 23. A cell according to claim 22 wherein said 

2 selectable marker is a Saccharomyces cerevisiae gene. 

1 24 . A cell according to claim 13 wherein said 

2 selectable marker is a dominant selectable marker. 

1 25, A method for producing a higher eukaryotic 

2 polypeptide in Pichia methanolica comprising: 

3 (a) culturing a P. methanolica cell into which has 

4 been introduced a heterologous DNA construct comprising the 

5 following operably linked elements: 

6 (i) a first DNA segment comprising a transcription 

7 promoter of a methanol- inducible P. methanolica gene; 

8 (ii) a second DNA segment encoding a higher 

9 eukaryotic polypeptide; 

10 (iii) a third DNA segment comprising a P. 

11 methanolica gene transcription terminator; and 

12 (iv) a selectable marker, under conditions in 

13 which the second DNA segment is expressed; and 
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14 (b) recovering the polypeptide encoded by the 

15 second DNA segment. 

1 26. A method according to claim 25 wherein the 

2 transcription promoter is a DNA segment that is contained 

3 within a 1.5 kilobase pair region 5' of and adjacent to a P. 

4 methanol ica structural gene coding sequence. 

1 27. A method according to claim 26 wherein said 

2 structural gene encodes an enzyme selected from the group 

3 consisting of alcohol oxidase, dihydroxyacetone synthase, 

4 formate dehydrogenase, and catalase. 

1 28. A method according to claim 27 wherein the 

2 first DNA segment comprises a sequence of nucleotides as shown 

3 in SEQ ID NO : 2 from nucleotide 24 to nucleotide 1354, 

1 29. A method according to claim 27 wherein the 

2 first DNA segment comprises a sequence of nucleotides as shown 

3 in SEQ ID NO : 9 from nucleotide 91 to nucleotide 169. 

1 30. A method according to claim 25 wherein the 

2 transcription terminator is a sequence that is contained 

3 within a 500 base pair region 3' of and adjacent to a P. 

4 methanolica structural gene coding sequence. 

1 31. A method according to claim 25 wherein the 

2 culturing step is carried out in a liquid medium comprising 

3 methanol and a repressing carbon source . 

1 32. A method according to claim 31 wherein said 

2 repressing carbon source is glucose. 

1 33. A method according to claim 25 wherein the 

2 cells are cultured to a density of from 80 to 4 00 grams of wet 

3 cell paste per liter prior to recovering the polypeptide. 
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1 34. A method according to claim 25 wherein the 

2 selectable marker is a P. wethanolica gene. 

1 35. A method according to claim 34 wherein the 

2 selectable marker is a P. methanolica ADE2 gene. 

1 36. A method according to claim 35 wherein the 

2 ADE2 gene comprises a sequence of nucleotides as shown in SEQ 

3 ID NO:l from nucleotide 407 to nucleotide 2851. 

1 37. A method according to claim 25 wherein the 

2 selectable marker is a gene from a fungus other than P. 

3 me thanol i ca . 

1 38. A method according to claim 37 wherein the 

2 selectable marker is a Saccharomyces cerevisiae gene. 

1 3 9. A method according to claim 25 wherein the 

2 selectable marker is a dominant selectable marker. 

1 40. A method according to claim 25 wherein the 

2 polypeptide is a human polypeptide. 

1 41. A method according to claim 25 wherein said 

2 heterologous DNA molecule is a linear DNA molecule. 

1 42. A method for preparing Pichia methanolica 

2 cells containing a foreign DNA construct comprising: 

3 (a) introducing into P. methanolica cells a DNA 

4 construct comprising the following operably linked elements: 

5 (i) a first DNA segment comprising a 

6 transcription promoter of a methanol -inducible 

7 P. methanolica gene; 

8 (ii) a second DNA segment encoding a higher 

9 eukaryotic polypeptide; 

10 (iii) a third DNA segment comprising a P. 

11 methanolica gene transcription terminator; and 
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12 (iv) a selectable marker; 

13 (b) culturing the P. methanolica cells from step 

14 (a) under conditions that select for growth of cells 

15 containing the DNA construct ; and 

16 (c) recovering cells that grow under the selective 

17 conditions. 

1 43 . A method according to claim 42 wherein said 

2 selectable marker is a P. methanolica, gene. 

1 44 . A method according to claim 43 wherein the 

2 selectable marker is a P. methanolica ADE2 gene, 

1 45. A method according to claim 44 wherein the 

2 ADE2 gene comprises a sequence of nucleotides as shown in SEQ 

3 ID NO:l from nucleotide 407 to nucleotide 2851, 

1 46. A method according to claim 42 wherein said 

2 selectable marker is a dominant selectable marker. 

1 47. A DNA molecule comprising a transcription 

2 promoter of a P- methanol zca gene, wherein said DNA molecule 

3 is essentially free of coding sequence of said gene. 

1 48. A DNA molecule according to claim 47 wherein 

2 said gene is a methanol -inducible gene. 

1 49. A DNA molecule according to claim 48 wherein 

2 said gene is selected from the group consisting of P- 

3 methanolica. alcohol oxidase, dihydroxyacetone synthase, 

4 formate dehydrogenase, and catalase genes. 

1 50. A DNA molecule according to claim 49 wherein 

2 said promoter comprises a sequence of nucleotides as shown in 

3 SEQ ID NO:2 from nucleotide 24 to nucleotide 1354. 
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1 51. A DNA molecule according to claim 49 wherein 

2 said promoter comprises a sequence of nucleotides as shown in 

3 SEQ ID NO: 9 from nucleotide 91 to nucleotide 169. 

1 52 . A DNA molecule comprising a transcription 

2 promoter of a P. methanolica gene, wherein said promoter is 

3 operably linked to a DNA segment encoding a protein other than 

4 a P. methanol ica protein. 
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53, A DNA molecule according to claim 52, wherein 
said gene is a methanol -inducible gene. 
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